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A NOVEL HUMAN DOPAMINE RECEPTOR AND ITS USES 

BACKGROUND OF THE INVENTION 
This application is a continuation-in-part of U.S. Patent Application Serial 
No. 07/626,618, filed on December 7, 1990, which is hereby incorporated by 
5 reference. 

This invention was made with government support under NIMH grant 
MH-45614 awarded by the National Institutes of Health, Unites States of 
America, and grant PG 11121 awarded by the Medical Research Council of 
Canada. The governments have certain rights in the invention. 

10 

1. Field of the Invention 

The invention relates to dopamine receptors from mammalian species and 
the genes corresponding to such receptors. In particular, it relates to the human 
dopamine receptor D4. Specifically, the invention relates to the isolation, cloning 

15 and sequencing of the human D4 receptor gene. The invention also relates to the 
construction of eukaryotic expression vectors capable of expression of the human 
D4 dopamine receptor in cultures of transformed eukaryotic cells and the 
synthesis of the human D4 dopamine receptor in such cultures. The invention 
relates to the use of such cultures of transformed eukaryotic cells producing the 

20 human D4 dopamine receptor for the characterization of antipsychotic drugs. 

2. Information Disclosure Statement 

Dopamine is a neurotransmitter that participates in a variety of different 
functions mediated by the nervous system, including vision, movement, and 

25 behavior (see generally Cooper et d., 1978, The flrehemicd frKlS of 
Neuropharmacology . 3d ed., Oxford University Press, New York, pp. 161-195). 
The diverse physiological actions of dopamine are in turn mediated by its 
interaction with two of the basic types of G protein-coupled receptors, Dl and 
D2, which respectively stimulate and inhibit the enzyme adenylyl cyclase 

30 (Kebabian & Calne, 1979, Nature 222: 93-96). Alterations in the number or 
activity of these receptors may be a contributory factor in disease states such as 
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Parkinson's disease (a movement disorder) and schizophrenia (a behavioral 
disorder). 

A great deal of information has accumulated on the biochemistry of the D 1 » 
and D2 dopamine receptors, and methods have been developed to solubilize and 

5 purify these receptor proteins (see Senogles et al., 1986, Biochemistry 2£: 749- * 
753; Sengoles et al., 1988, J. Biol. Chem. 2SJ: 18996-19002; Gingrich et al., 
1988, Biochemistry 21- 3907-3912). The Dl dopamine receptor in several tissues 
appears to be a glycosylated membrane protein of about 72 kD (Amlaiky et al., 
1987, Mol. Pharmacol. 21: 129-134; Ninik et al., 1988, Biochemistry 22: 7594- 

10 7599). The D2 receptor has been suggested to have a higher molecular weight 
of about 90 - 150 kD (Amlaiky & Caron, 1985, J. Biol. Chem. 26Q: 1983-1986; 
Amlaiky & Caron, 1986, J. Neurochem. 4J: 196-204; Jarvie et al., 1988, Mol. 
Pharmacol. 24: 91-97). Much less is known about a recently discovered 
additional dopamine receptor, termed D3 (Sokoloff etal. , 1990, Nature 347:146- 

15 151). 

Dopamine receptors are primary targets in the clinical treatment of 
psychomotor disorders such as Parkinson's disease and affective disorders such 
as schizophrenia (Seeman et al., 1987, Neuropsychopharm. J.: 5-15; Seeman, 
1987, Synapse 1: 152-333). The three different dopamine receptors (D 1 , D2, D3) 

20 have been cloned as a result of nucleotide sequence homology which exists 
between these receptor genes (Bunzow et al. , 1988, Nature 23i: 783-787; Grandy 
et al., 1989, Proc. Natl. Acad. Sci. USA gfi: 9762-9766; Dal Toso et al., 1989, 
EMBO J. fi: 4025-4034; Zhou et al., 1990, Nature 3ifr 76-80; Sunahara et al., 
1990, Nature 24£: 80-83; Sokoloff et al., 1990, Nature 2*2: 146-151). 

25 The antipsychotic clozapine is useful for socially withdrawn and treatment- 

resistant schizophrenics (see Kane et al., 1990, Nature 347: 146-151), but unlike 
other antipsychotic drugs, clozapine does not cause tardive dyskinesia (see Casey, 
1989, Psychopharmacology 22: 547-553). Clozapine, however, has dissociation 
constants for D2 and D3 which are 3 to 30-fold higher than the therapeutic free f 

30 concentration of clozapine in plasma water (Ackenheil et al., 1976, Arzneim- 
Forsch 2fi: 1156-1158; Sandoz Canada, Inc., 1990, Clozaril: Summary of 
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preclinical and clinical data). This suggests the existence of dopamine receptors 
more sensitive to the antipsychotic clozapine than those known in the prior art 
heretofore. 

We have cloned and sequenced such a human dopamine receptor which we 
5 term D4, The dopamine D4 receptor gene has high homology to the human 
dopamine D2 and D3 receptor genes. The pharmacological profile of this 
receptor resembles that of the D2 and D3 receptors but it has an affinity for 
clozapine which is tenfold higher. The present inventors envision that the D4 
dopamine receptor disclosed as this invention may prove useful in discovering 
10 new types of drugs for schizophrenia that like clozapine do not induce tardive 
dyskinesia and other motor side effects. 

We have also discovered that the D4 gene is polymorphic in the human 
population, having at least 7 different alleles that can be detected by restriction 
fragment length polymorphism analysis (see, Botstdn ex o/ M 1980, Am. J. Hum. 
15 Genet. 22: 314-331). This is the first receptor in the catecholamine receptor 
family which displays polymorphic variations in the human population. The 
observed polymorphism in dopamine D4 receptor genes may underlie individual 
differences in susceptibility to neuropsychiatric disorders such as schizophrenia 
and manic depression, as well as responsiveness to antipsychotic medication. 

20 

SUMMARY OF THE INVENTION 
The present invention is directed toward the isolation, characterization and 
pharmacological use of the human D4 dopamine receptor, the gene corresponding 
to this receptor, a recombinant eukaryotic expression construct capable of 
25 expressing the human D4 dopamine receptor in cultures of transformed eukaryotic 
cells and such cultures of transformed eukaryotic cells that synthesize the human 
D4 dopamine receptor. 

It is an object of the invention to provide a nucleotide sequence encoding 
a mammalian dopamine receptor. Further, it is an object of the invention to 
30 provide a nucleotide sequence that encodes a mammalian dopamine receptor with 
novel and distinct pharmacological properties. It is specifically an object of the 
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invention to provide a nucleotide sequence encoding a mammalian dopamine 
receptor having the particular drug dissociation properties of the human dopamine 
receptor D4. In particular, the mammalian dopamine receptor encoded by the 
nucleotide sequence of the present invention has a high affinity for the drug 
5 clozapine. The human D4 dopamine receptor embodied in the present invention 
shows a dissociation constant (termed Kj) of 1-40 nanomolar (nM), preferably 1- 
20 nM, most preferably 11 nM clozapine, as detected by the pH]spiperone 
binding assay disclosed herein. The human D4 dopamine receptor embodied in 
the present invention displays the following pharmacological profile of inhibition 
10 of pH]spiperone binding in the [ 3 H]spiperone binding assay: spiperone > 
eticlopride > clozapine > (+)-butaclamol > raclopride > SCH23390. In a 
preferred embodiment of the invention, the nucleotide sequence encoding a 
dopamine receptor encodes the human dopamine receptor D4. 

The present invention provides a nucleotide sequence encoding a 
15 mammalian dopamine receptor that is the human D4 receptor. In a preferred 
embodiment, this nucleotide sequence comprises a cDNA sequence isolated from 
RNA derived from the human neuroblastoma cell line SK-N-MC [SEQ ID No: 
17], comprising the sequences of the D4.2 allele of the human D4 dopamine 
receptor gene. In another preferred embodiment, this nucleotide sequence 
20 comprises a cDNA sequence isolated from RNA derived from human pituitary 
gland tissue [SEQ ID No: 19]. In yet another preferred embodiment, this 
nucleotide sequence comprises a cDNA sequence isolated from RNA derived from 
human substantia nigra tissue [SEQ ID No.: 19]. Both of these embodiments 
comprise the sequences of the D4.4 allele of the human D4 dopamine receptor 
25 gene. 

The invention also includes a nucleotide sequence derived from human 
genomic DNA [SEQ ID Nos.: 1,3,4,5,7,12,14 & 15] comprising the sequences 
of the D4.7 allele of the human D4 dopamine receptor gene, and a nucleotide 
sequence derived from human genomic DNA [SEQ ID Nos.: 1,3,4,5,7,10,14 & 
30 15] comprising the sequences of the D4.4 allele of the human D4 dopamine 
receptor gene. In this embodiment of the invention, the nucleotide sequence 
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includes 5 kilobases (kb) of human genomic DNA encoding the dopamine receptor 
D4. This embodiment includes the sequences present in the cDNA embodiments 
as well as nucleotide sequences of 5' untranslated sequence, three intervening 
sequences that interrupt the coding sequence of the human D4 dopamine receptor 

5 gene, and 3' untranslated sequences. Also provided is a cDNA sequence derived 
from the genomic DNA sequence of the D4.4. allele [SEQ ID No: 19] and the 
D4.7 allele [SEQ ID No: 21] of the human D4 dopamine receptor gene. 

The invention includes a nucleotide sequence of a human D4 receptor 
molecule, and includes allelic variations of this nucleotide sequence and the 

10 corresponding D4 receptor molecule, either naturally occurring or the product of 
in vitro chemical or genetic modification, having essentially the same nucleotide 
sequence as the nucleotide sequence of the human D4 receptor disclosed herein, 
wherein the resulting human D4 receptor molecule has substantially the same drug 
dissociation properties of the human D4 receptor molecule corresponding to the 

15 nucleotide sequence described herein. Specific preferred embodiments include 
alleles D4.2, D4.4 and D4.7 of the human D4 dopamine receptor gene, as defined 
herein. 

The invention provides sequences of the naturally-occurring alleles of the 
human D4 dopamine receptor gene. Such alleles are defined as comprising from 

20 about 2 to about 8 repeats of a nucleotide sequence that is substantially 
homologous to the sequence [SEQ ID Nos: 8, 10, 12, 17, 19,21]: 
5'-A CCC GCG CCC CGC CTC CCC CAG GAG CCC TGC GGC CCC GAC 
TGTGCGCC-3'. 

Allelic variations of this nucleotide sequence and the corresponding D4 

25 receptor molecule, either naturally occurring or the product of in vitro chemical 
or genetic modification, having essentially the same nucleotide sequence as the 
nucleotide sequence of the human D4 receptor disclosed herein, wherein the 
resulting human D4 receptor molecule has substantially the same.drug dissociation 
properties of the human D4 receptor molecule corresponding to the nucleotide 

30 sequence described herein are additional preferred embodiments of the invention. 
Specific preferred embodiments include the allele D4.2, comprising 2 copies of 
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the repeat tandemly repeated [SEQ ID Nos: 8 & 17]; the allele D4.4, comprising 
4 copies of the repeat tandemly repeated [SEQ ID Nos: 10 & 19]; and the allele 
D4.7, comprising 7 copies of the repeat tandemly repeated [SEQ ID Nos: 12 & 
21]. 

5 The invention also includes a predicted amino acid sequence for the human 

D4 dopamine receptor deduced from the nucleotide sequence comprising the 
complete coding sequence of the D4 dopamine receptor gene [SEQ ID Nos: 18, 
20 & 22]. Specific preferred embodiments comprise the amino acid sequence of 
the naturally-occurring alleles of the human D4 dopamine receptor gene. Such 

10 alleles are defined as comprising from about 2 to about 8 repeats of an amino acid 
sequence that is substantially homologous to the sequence [SEQ ID Nos: 
9,11,13,18,20,22]: 

(P/A)AP(R/G)LP(Q/R/P)(D/G)PCG(P/S)(D/N)CAP 
Allelic variations of this amino acid and the corresponding D4 receptor 

15 molecule, either naturally occurring or the product of in vitro chemical or genetic 
modification, having essentially the same amino acid sequence as the human D4 
receptor disclosed herein, wherein the human D4 receptor molecule has 
substantially the same drug dissociation properties of the human D4 receptor 
molecule corresponding to the amino acid sequence described herein are additional 

20 preferred embodiments of the invention. Specific preferred embodiments include 
the allele D4.2, comprising 2 copies of the repeat tandemly repeated [SEQ ID 
Nos: 9 & 18]; the allele D4.4, comprising 4 copies of the repeat tandemly 
repeated [SEQ ID Nos: 1 1 & 20]; and the allele D4.7, comprising 7 copies of the 
repeat tandemly repeated [SEQ ID Nos: 13 & 22]. 

25 This invention provides both nucleotide and amino acid probes derived 

from these sequences. The invention includes probes isolated from either cDNA 
or genomic DNA clones, as well as probes made synthetically with the sequence 
information derived therefrom. The invention specifically includes but is not 
limited to oligonucleotide, nick-translated, random primed, or in vitro amplified 

30 probes made using cDNA or genomic clones embodying the invention, and 
oligonucleotide and other synthetic probes synthesized chemically using the 
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nucleotide sequence information of cDNA or genomic clone embodiments of the 
invention. The sequence information provided by the present invention is also 
intended to provide the basis for in vitro amplification methods for detecting D4 
dopamine receptor alleles comprising the genotype of somatic and germ cells, 

5 zygotes, embryoes, and tissues in humans and other mammals for diagnostic, 
therapeutic and other purposes. 

It is a further object of this invention to provide sequences of the human 
D4 dopamine receptor for use as probes to determine the pattern, amount and 
extent of expression of this receptor in various tissues of mammals, including 

10 humans. It is also an object of the present invention to provide probes derived 
from the sequences of the human D4 dopamine receptor to be used for the 
detection and diagnosis of genetic diseases. It is an object of this invention to 
provide probes derived from the sequences of the human D4 dopamine receptor 
to be used for the detection of novel related receptor genes. 

15 The present invention also includes synthetic peptides made using the 

nucleotide sequence information comprising the cDNA or genomic clone 
embodiments of the invention. The invention includes either naturally occurring 
or synthetic peptides which may be used as antigens for the production of D4 
dopamine receptor-specific antibodies, or used for competitors of the D4 receptor 

20 molecule for drug binding, or to be used for the production of inhibitors (or 
blockers) of the binding of dopamine or dopamine analogs of the D4 dopamine 
receptor molecule. As used herein, the term "inhibitor of dopamine binding" is 
intended to encompass biochemical agonists and/or antagonists of dopamine 
binding to the D4 dopamine receptor. 

25 In addition, this invention includes recombinant DNA constructs 

comprising the human D4 dopamine receptor and sequences that mediate the 
replication and selected growth of microorganisms that carry this construct. 

The present invention provides recombinant expression constructs 
comprising the nucleotide sequence of the human D4 dopamine receptor and 

30 sequences sufficient to direct the synthesis of the human D4 dopamine receptor 
protein in cultures of transformed eukaryotic cells. In preferred embodiments, the 
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recombinant expression construct is comprised of plasmid sequences derived from 
the plasmid pCD-PS and D4 dopamine receptor sequences corresponding to cDNA 
sequences for alleles D4.2, D4.4 and D4.7, as defined herein, as well as a hybrid < 
human D4 dopamine gene, comprised of the entirety of the genomic sequences 

5 from a particular D4 dopamine genomic clone described herein, up to a Pstl site 
located in exdn HI, followed by the remainder of the coding and 3' untranslated 
sequences found in a particular human cDNA sequence derived from a human 
neuroblastoma cell line. Recombinant expression constructs of the invention also 
encompass embodiments comprising allelic variations of the human D4 dopamine 

10 receptor genomic DNA sequences and cDNA-derived sequences. This invention 
includes recombinant expression constructs comprising essentially the nucleotide 
sequences of genomic and cDNA clones of the human D4 dopamine receptor and 
allelic variations thereof in embodiments that provide for the expression of human 
D4 dopamine receptor protein in cultures of transformed eukaryotic cells. 

15 it is also an object of this invention to provide cultures of transformed 

eukayotic cells that have been transformed with such recombinant expression 
constructs and that synthesize human D4 dopamine receptor protein. In a 
preferred embodiment, the invention provides monkey COS cells that synthesize 
human D4 dopamine receptor protein. 

20 The present invention also includes protein preparations of the human D4 

dopamine receptor, and preparations of membranes containing the human D4 
dopamine receptor, derived from cultures of eukaryotic cells transformed with the 
recombinant expression constructs of the invention. In a preferred embodiment, , 
cell membranes containing human D4 dopamine receptor protein are isolated from 

25 culture of COS-7 cells transformed with a recombinant expression construct that 
directs the synthesis of human D4 dopamine receptor. 

It also an object of this invention to provide the human D4 dopamine 
receptor for use in the in vitro screening of novel antipsychotic compounds. In 
a preferred embodiment, membrane preparations containing the human D4 

30 dopamine receptor, derived from cultures of eukaryotic cells transformed with the 
recombinant expression constructs of the invention, are used to determine the drug 
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dissociation properties of antipsychotic compounds in vitro. These properties are 
then used to characterize novel antipsychotic compounds by comparison to the 
binding properties of known antipsychotic compounds. 

The present invention will also be useful for the detection of dopamine and 
5 dopamine analogues, known or unknown, either naturally occurring or as the 
embodiments of antipsychotic or other drugs* 

It is an object of the present invention to provide a method for the 
quantitative detection of dopamine and dopamine analogues, either naturally 
occurring or as the embodiments of antipsychotic or other drugs. It is an 
10 additional object of the invention to provide a method to detect dopamine or 
dopamine analogues in blood, saliva, semen, cerebrospinal fluid, plasma, lymph, 
or any other bodily fluid. 

DESCRIPTION OF THE DRAWINGS 
IS Figure 1 illustrates the structure of a genomic clone comprising the human 

D4 dopamine receptor gene. 

Figure 2 illustrates the nucleotide sequence of genomic and cDNA clones 
of the human D4 dopamine receptor gene. 

Figure 3 provides an amino acid sequence alignment of mammalian 
20 dopamine receptors 

Figure 4 shows the binding of pH]spiperone to membranes of COS-7 cell 
transfected with a recombinant expression construct that expresses the human D4 
receptor protein. 

Figure 5 demonstrates the pharmacological specificity of pHJspiperone 
25 binding to COS-7 cells transfected with a human D4 receptor expression 
construct. 

Figure 6 illustrates the structure of a genomic clone comprising the human 
D4 dopamine receptor gene and the nucleic acid and corresponding amino acid 
sequences of 2, 4 and 7 copies of a novel 48 bp tandem repeat. 
30 Figure 7 illustrates restriction fragment length polymorphic variants of the 

human D4 receptor gene in 9 individuals. 
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Figure 8 demonstrates the transcriptional integrity of each of three cloned 
variant human D4 receptor gene expression constructs expressed in transfected 
COS-7 cells. f 

Figure 9 illustrates Scatchard analysis (panels a) and pEQ-spiperone 
5 competition binding experiments (panels b) of each of three cloned variant human - 
D4 receptor gene expression constructs expressed in transfected COS-7 cells. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
The term "D4 dopamine receptor" as used herein refers to proteins 

10 substantially homologous to, and having substantially the same biological activity 
as, the protein coded for by the nucleotide sequences depicted in Figure 2 and 
Figure 6 (i.e., proteins which display high affinity binding to clozapine) [SEQ ID 
Nos: 1,3,4,5,7,8,10,12,14 & 15]. This definition is intended to encompass 
natural allelic variations in the D4 dopamine receptor sequence, specifically 

15 including the alleles D4.2, D4.4 and D4.7, as defined herein [SEQ ID Nbs.: 
17,19 & 21], and all references to the D4 dopamine receptor, and nucleotide and 
amino acid sequences thereof are intended to encompass such allelic variations, 
both naturally-occurring and man-made. Cloned genes of the present invention 
may code for D4 dopamine receptors of any species of origin, including, mouse, 

20 rat, rabbit, cat, and human, but preferably code for receptors of mammalian, most 
preferably human, origin. 

The production of proteins such as the D4 dopamine receptor from cloned 
genes by genetic engineering is well known (see, e.g. , U.S. Patent No. 4,761,371 
to Bell et al at Col. 6 line 3 to Col. 9 line 65; the disclosure of all U.S. patent 

25 references cited herein is to be incorporated herein by reference). The discussion 
which follows is accordingly intended as an overview of this field, and is not 
intended to reflect the full state of the art. 

DNA which encodes the D4 dopamine receptor may be obtained, in view 
of the instant disclosure, by chemical synthesis, by screening reverse transcripts 

30 of mRNA from appropriate tissues, cells or cell line cultures, by screening 
genomic libraries from appropriate cells, or by combinations of these procedures, 
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as illustrated below. Screening of mRNA or genomic DNA may be carried out 
with oligonucleotide probes generated from the D4 dopamine receptor gene 
sequence information provided herein. Probes may be labeled with a detectable 
group such as a fluorescent group, a radioactive atom or a chemiluminescent 
5 group in accordance with know procedures and used in conventional hybridization 
assays, as described in greater detail in the Examples below. In the alternative, 
D4 dopamine receptor gene sequences may be obtained by use of the polymerase 
chain reaction (PCR) procedure, with the PCR oligonucleotide primers being 
produced from the D4-dopamine receptor gene sequence provided herein (see 

10 U.S. Patent Nos. 4,683,195 to Mullis et al. and 4,683,202 to Mullis). 

The D4 dopamine receptor may be synthesized in host cells transformed 
with constructs containing DNA encoding the D4 dopamine receptor. Such 
constructs are replicable and are used herein either to amplify DNA encoding the 
D4 dopamine receptor and/or to express DNA which encodes the D4 dopamine 

IS receptor. An expression construct is a replicable DNA construct in which a DNA 
sequence encoding the D4 receptor is operably linked to suitable control sequences 
capable of effecting the expression of the D4 receptor in a suitable host. The 
need for such control sequences will vary depending upon the host selected and 
the transfection method chosen. Generally, control sequences include a 

20 transcriptional promoter, an optional operator sequence to control transcription, 
a sequence encoding suitable mRNA ribosomal binding sites, and sequences which 
control the termination of transcription and translation. When used for DNA 
amplification such constructs do not require expression control domains. All that 
is needed is the ability to replicate in a host, usually conferred by an origin of 

25 replication, and a selective marker gene to facilitate recognition of transformants. 

Constructs useful for practicing the present invention include plasmids, 
viruses (including phage), retroviruses, and integratable DNA fragments (i.e., 
fragments integratable into the host genome by homologous recombination). The 
construct may replicate and function independently of the host genome, or may * 

30 in some instances, integrate into the host genome itself. Suitable constructs will 
contain replicon and control sequences which are derived from species compatible 
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with the intended expression host. Transformed host cells are cells which have 
been transformed, transfected or infected with the D4 receptor-containing 
constructs assembled using recombinant DNA techniques. Transformed host cells 
ordinarily express the D4 receptor, but host cells transformed for purposes of 
5 cloning or amplifying the D4 receptor DNA need not express the D4 receptor. 
When expressed, the D4 receptor will typically be located in the host cell 
membrane. 

DNA regions are operably linked when they are functionally related to 
each other. For example: a promoter is operably linked to a coding sequence if 

10 it controls the transcription of the sequence; a ribosome binding site is operably 
linked to a coding sequence if it is positioned so as to permit translation. 
Generally, operably linked means contiguous and, in the case of leaders 
sequences, contiguous and in the same translation^ reading frame. 

Cultures of cells derived from multicellular organisms are a desirable host 

15 for recombinant D4 dopamine receptor synthesis. In principal, any higher 
eukaryotic cell culture can be used, whether from vertebrate or invertebrate 
culture. However, mammalian cells are preferred, as illustrated in the Examples. 
Propagation of such cells in cell culture has become a routine procedure (see 
Tissue Culture . Academic Press: New York (Kruse & Patterson, eds.) 1973). 

20 Examples of useful host cell lines are VERO and HeLa cells, Chinese hamster 
ovary (CHO) cell lines, and WI138, BHK, COS-7, CV, and MDCK cell lines. 
Expression constructs for such cells ordinarily include (if necessary) an origin of 
replication, a promoter located upstream from the gene to be expressed, along 
with a ribosome binding site, RNA splice site (if intron-containing genomic DNA 

25 is used), a polyadenylation site, and a transcriptional termination sequence. 

The transcriptional and translational control sequences in expression 
constructs to be used in transforming vertebrate cells are often provided by viral 
sources. For example, commonly used promoters are derived from polyoma, 
Adenovirus 2, and Simian Virus 40 (SV40; see, e.g., U.S. Patent No. 

30 4,599,308). The early and late promoters of SV40 are useful because both are 
obtained easily from the virus within a fragment which also contains the SV40 
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viral origin of replication (see Fiers et al, 1978, Nature 223: 1 13). Further, the 
human genomic D4 receptor promoter, control and/or signal sequences, may also 
be used, provided such control sequences are compatible with the host cell 
chosen. 

5 An origin of replication may be provided either within the construct itself, 

such as may be derived from SV40 or other viral source (e.g., Polyoma, 
Adenovirus, VSV, or MPV), or may be provided by the host cell chromosomal 
replication mechanism. If the construct is integrated into the host cell 
chromosome, the latter may be sufficient. 

10 D4 dopamine receptors made from cloned genes in accordance with the 

present invention may be used for screening compounds for D4 dopamine receptor 
activity, or for determining the amount of a dopaminergic drug in a solution (e.g. , 
blood plasma or serum). For example, host cells may be transformed with a 
construct of the present invention, D4 dopamine receptors expressed in that host, 

15 the cells lysed, and the membranes from those cells used to screen compounds for 
D4 dopamine receptor binding activity . Competitive binding assays in which such 
procedures may be carried out are well known, as illustrated by the Examples 
below. By selection of host cells which do not ordinarily express a dopamine 
receptor, pure preparations of membranes containing D4 receptors can be 

20 obtained. Further, D4 dopamine receptor agonist and antagonists can be 
identified by transforming host cells with constructs of the present invention. 
Membranes obtained from such cells can be used in binding studies wherein the 
drug dissociation constants are measured. Such cells must contain D4 protein in 
the plasma and other cell membranes. Procedures for carrying out assays such 

25 as these are also described in greater detail in the Examples which follow. 

Cloned genes and constructs of the present invention are useful to 
transform cells which do not ordinarily express the D4 dopamine receptor to 
thereafter express this receptor. Such cells are useful as intermediates for making 
cell membrane preparations for receptor binding assays, which are in turn useful 

30 for drug screening. Further, genes and constructs of the present invention are 
useful in gene therapy. For such purposes, retroviral constructs as described in 
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U.S. Patent No. 4,650,764 to Temin and Watanabe or U.S. Patent No. 4,861,719 
to Miller may be employed. Cloned genes of the present invention, or fragments 
thereof, may also be used in gene therapy carried out homologous recombination 
or site-directed mutagenesis (See generally Thomas & Capecchi, 1987, Cell £1: 

5 503-512; Bertling, 1987, Bioscience Reports 2: 107112; Smithies et al. 9 1985, 
Nature 212: 230-234). 

Cloned genes of the present invention, and oligonucleotides derived 
therefrom, are useful for screening for restriction fragment length polymorphism 
(RFLP) associated with genetic polymorphisms within a population. SuchRFLPs 

10 may also be associated with certain genetic disorders, and the probes provided by 
the invention can be used for their identification and the identification of 
individuals susceptible to neuropsychiatry disorders such as schizophrenia and 
manic depression. Such RFLPs may also be useful for predicting individual 
responsiveness to psychotropic and antipsychotic drugs. 

15 Oligonucleotides of the present invention are useful as diagnostic tools for 

probing D4 receptor gene expression in nervous tissue. For example, tissue can 
be probed in situ with oligonucleotide probes carrying detectable label groups by 
conventional autoradiography techniques, as explained in greater detail in the 
Examples below, to investigate native expression of this receptor or pathological 

20 conditions relating thereto. Further, chromosomes can be probed to investigate 
the location of the D4 dopamine receptor gene, and potential pathological 
conditions related thereto, as also illustrated by the Examples below. 

Oligonucleotides of the present invention are also useful for in vitro 
amplification of D4 dopamine receptor sequences. Amplification methods include 

25 but are not intended to be limited to the polymerase chain reaction and the ligase 
chain reaction. Amplification of D4 dopamine receptor sequences is useful as a 
diagnostic tools for analyzing and quantitating D4 receptor gene expression in 
tissue, for example nervous tissue. Additionally, the use of oligonucleotides 
synthesized or isolated according to methods well known in the art that comprise 

30 D4 dopamine receptor sequences provided by the invention permit in vitro 
amplification methods to be used for the detection of D4 dopamine receptor alleles 
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comprising the genotype of somatic and germ cells, zygotes, embryoes, and 
tissues in humans and other mammals for diagnostic, therapeutic and other 
purposes. 

The Examples which follow are illustrative of specific embodiments of the 
5 invention, and various uses thereof. They are set forth for explanatory purposes 
only, and are not to be taken as limiting the invention. 

EXAMPLE 1 

Screening Tissue and Cell Line RNA for 
10 Dopamine Receptor Expression 

RNA was prepared from different rat tissues or cell lines using the 

guadinium thiocyanate/CsCl procedure described in Bunzow et aL, 1988, Nature 

336 : 783-787. Tissues tested included heart, epididymis, testis, gut, pancreas, 

spleen, thymus, muscle, ventricle, atria, lung, adrenal, kidney, liver, pineal gland 

15 and pituitary. Cell lines screened included SK-N-MC, SK-N-SH, COS, AKR1, 
LUr, GH4C1, NG108-15, AtT20, 3T3, BSC40, C6, CV-1, Hela, IMR-32, 
N4TG1, NCB-20, PC-12, Rin m5f and WERI-Rb-1. 20 fig of RNA was analyzed 
by Northern blot hybridization with a radiolabeled BstYl-BgKl DNA fragment of 
the rat D2 receptor, which encodes the putative transmembrane domains VI and 

20 VII. Blots were hybridized under standard conditions as described in Bunzow et 
a/., ibid.; hybridization was performed overnight at 37°C. Blots were then 
washed at 55 °C in 2X standard saline-citrate (SSC) and 1% sodium dodecyl 
sulfate (SDS). Washed blotes were exposeed to X-ray film for two days at -70°C 
using an intensifying screen. For comparison, the same blot was hybridized under 

25 high stringency conditions (the modifications of which include using 50% 
formamide and 42°C for the hybridication and 0.2X SSC for the wash). Under 
conditions of low stringency the SK-N-MC cell line showed a positive signal in 
these experiments. 

30 EXAMPLE 2 

Construction of a cDNA Phage Library using Neuroblastoma RNA 
Double-stranded cDNA was synthesized using standard techniques [see 
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Sambrook et aL, 1989, Molecular Clo ning: A Laboratory Manual. 2d ed., Cold 
Spring Harbor Laboratory Press: New York] from poly(A) + mRNA isolated from 
the human neuroblastoma cell line SK-N-MC as described in Example 1. The 
cDNA was directionally cloned into the EcoBl and Xhol restriction endonuclease 

5 sites of the phage cloning vector lambda ZAPII (Stratagene, La Jolla, CA). The 
library was transferred to colony plaque screen filters (New England Nuclear, 
Boston, MA). Approximately 500,000 independent clones were screened under 
low-stringency hybridzation conditions as described in Example 1 . Hybridization 
was performed for 30 hrs with ^-labeled 1 .6 kb BomHI - Bgltt and 300 bp BstYI 

10 - BgKL fragments of a rat D2 receptor clone at a specific activity of ltf dpm/^g. 
Filters were washed at 55°C in 2X SSC and 1% SDS. The clone D210S was 
isolated and sequenced using the Sanger dideoxy chain termination method 
catalyzed by Sequenase (U.S. Biochemical Corporation, Cleveland, OH). The 
sequence of this clone is shown in Figure 2 (hatched area). 

15 The putative coding sequence is shown in capitals (non-coding sequence 

is in italics) and the deduced amino acid sequence is shown above the nucleotide 
sequence. Numbering of the putative coding sequence begins with the first 
methionine of the open reading frame. The sequence corresponding to the cDNA 
clone is hatched. Single-letter abbreviations for amino acids and nucleotides used 

20 herein can be found in G. Zubay, Biochemistry (2d. ed.), 1988 (MacMillen 
Publishing: New York) p.33. Noteworthy is the presence of a duplicated 48 bp 
sequence in the putative third exon, corresponding to the third cytoplamsic loop 
region of the D4 receptor protein. The complete nucleotide sequence of this clone 
has been determined (see Figure 6, wherein these repeated sequences of this clone 

25 are designated D4.2 [SEQ ID No: 17]). 

EXAMPLE 3 

Screening a Genomic DNA Phage Library 
with a Huma n Dopamine Receptor Probe 

30 Clone D210S was ^P-labeled by random primed synthesis and used to 

screen a commercially available human genomic library cloned in the phage 

vector EMBL3 (Clonetech, Palo Alto, CA), Hybridization was performed as 
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described in Example 2 using 50% formamide. After hybridization the filters 
were washed at 65°C in 0.1X SSC and 0.1% SDS. The clone D210G was 
isolated and analyzed by restriction endonuclease and Southern blot analysis. The 
map of this genomic clone is shown in Figure 1, wherein the structure of the D4 

, 5 receptor gene is compared with the structure of the D2 gene. Relevant restriction 
endonuclease sites in the D4 receptor sequence are indicated. The Sail site is part 
of the cloning site in EMBL3. The proposed coding regions are boxed and 
numbered in Roman numerals* Perfect matches of proposed intron/exon junction 
sites are indicated by connecting stippled ban between the receptor clones. 

10 Pstl - PstI fragments of approximately 1.3 kb and 2.6 kb, and an 

overlapping Sail - EcoRI fragment of approximately 2.0 kb derived from the D4 
receptor gene were subcloned into the plasmid pBluescript-SK (Stratagene). The 
subcloned fragments were characterized by sequence analysis as described above. 
This sequence is shown in Figure 2. The complete nucleotide sequence of this 

15 clone has been determined (see Figure 6, wherein these repeated sequences of this 
clone are designated D4.7 [SEQ ID No: 21]). 

EXAMPLE 4 

DNA Sequence Analysis of the Huma n D4 Dopamine Receptor 
20 One of the cDNA clones detected by screening the SK-N-MC 

neuroblastoma library with a rat D2 probe at low stringency (D210S) contained 
a 780 bp EeoBl-XhoI insert which hybridized to the rat probe. Screening of a 
human genomic EMBL3 library (Clontech) under high stringency conditions with 
the clone D210S resulted in the isolation of the genomic clone D210G. 
25 Southern blot and sequence analysis indicated that the clone contained a 

5 kb Sall-Pstl fragment which coded for the entire gene of D210S [SEQ ID No.: 
21], Sequence analysis of this insert showed the presence of an open reading 
frame with homology to the amino acid sequence of transmembrane domains V 
(45%), VI (46%) and VII (78%) of the D2 receptor, shown in Figure 3. The 
30 putative amino acid sequence of the human D4 receptor [SEQ ID No.: 22] is 
aligned with the human and rat D2 f rat D3 and human and rat Dl receptor 
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sequences. Amino acids conserved within the group of dopamine receptors are 
shaded. The putative transmembrane domains are overlined and labeled by 
Roman numerals. 

There is a potential translation initiation codon (ATG) 590 bp downstream 

5 from the SaR site, followed by an open reading frame that showed amino acid 
sequence homology with transmembrane domain I (36%) and II (63%) of the D2 
receptor. Almost immediately downstream from the transmembrane domain n 
sequence, homology to the D2 receptor disappears, indicating the presence of an 
intron in the genomic DNA. This intron spanned approximately 2 kb, after which 

10 sequence homology to the D2 receptor was re-established. Translation of the 
putative gene product showed homology to the transmembrane domains in (68%), 
IV (37%), V(46%) and Vn (78%) of the D2 receptor (see Figure 3). 

Potential splice junction donor and acceptor sites (Mount, 1982, Nucl. 
Acids Res. lfl: 461-472) were found in the transmembrane domains II, in and VI, 

15 as shown in Figure 1. These splice sites were at an identical position as in the 
D2 and D3 receptor gene [see Grandy et al. t 1989, Proc, Natl. Acad. Sci. USA 
Sfi: 9762-9766; Dal Toso et al. r 1989, EMBO J. ft 4025-4034; Sokoloff et o/ M 
1990, Nature ML 146-151] and Figure 1. The coding sequence downstream 
from transmembrane domain IV is identical to the sequence of clone D210S but 

20 is interrupted by an intron of about 300 bp between transmembrane domains V 
and VI and an additional intron of 92 bp in transmembrane VI (Figure 1, hatched 
area). The precise location of the splice site for the intron between 
transmembrane V and VI cannot be determined due to the fact that a sequence of 
52 bp present in the coding sequence is repeated exactly on either side of the 

25 intron (Figure 2). 

The deduced amino acid sequence from the genomic and cDNA nucleotide 
sequences indicated that this gene codes for a protein of 387 amino acids with an 
apparent molecular weight of 41kD. A hydrophobicity plot of the protein 
sequence suggests the existence of seven transmembrane domains. These regions 

30 correlate with the observed homologous regions in the human D2 receptor and 
other receptors belonging to the family of G-protein coupled receptors (Dohlman 



-18- 



WO 94/03602 



PCTAJS93/07370 



et o/ M 1987, Biochemistry 2& 2657-2664; Bunzow et a/., 1988, Nature 226: 783- 
787; Sokoloff et al., 1990, Nature 242: 146-151; and Figure 2). A potential N- 
linked glycosylation site (Hubbard & Ivatt, 1981, Ann. Rev. Biochem. 5Q: 555- 
583) is located two amino acids downstream from the initiation methionine. The 
5 amino acid residues Asp (80) and Asp (115) in the D4 receptor, which are 
conserved within the family catecholaminergic receptors, are postulated to act as 
"counterfoils" in catecholamine binding (Strader et al. , 1988, J. Biol. Chem. 262: 
10267-10271). Also conserved within the family of catecholaminergic receptors 
are Ser (197) and Ser (700) which have been suggested to interact with the 

10 catechol hydroxyl groups (Kozak, 1984, Nucleic Acids Res. 12: 857-872). 
Several consensus sites for potential phosphorylation by protein kinase C and 
protein kinase A are found in the third cytoplasmic loop (Sibley et al., 1987, Cell 
48: 913-922; Bouvier et al. f 1988, Nature 222: 370-373). The Cys (187), which 
may serve as a substrate for palmitoylation, is conserved in most of the G-protein 

15 coupled receptors (O'Dowd et a/., 1989, J. Biol. Chem 2M- 7564-7569). The 
short carboxyl tail, which terminates similar to the D2 and D3 receptor at Cys 
(387) (Bunzow et al. f 1988, Nature 22& 783-787; Grandy et al., 1989, Proc. 
Natl. Acad. ScL USA Sfi: 9762-9766; Dal Toso et al., 1989, EMBO J. 8: 4025- 
4034; Sokoloff et a/., 1990, Nature 242: 146-151), and the relatively large third 

20 cytoplasmic loop, are features observed in most receptors which interact with an 
isoform of the G protein. 

A noteworthy feature of the sequence of the third exon of the genomic D4 
receptor clone is the presence of a 7-fold repeat of a GC rich, 48 bp sequence, 
beginning at nucleotide 447 of exon in, and encodes a proline-rich portion of the 

25 D4 dopamine receptor protein (see Figure 6, wherein these sequences of this 
clone are designated D4.7 [SEQ ID No.:21]). This region of the protein 
corresponds to the putative third cytoplasmic loop of the receptor protein molecule 
[SEQ ID No.: 22]. This sequence corresponds to the 2-fold repeat of a 
homologous sequence found in the SK-N-MC neuroblastoma cDNA sequence 

30 described in Example 2, suggesting that the D4 receptor gene may be 
polymorphic. This sequence is uniquely found in the D4 receptor and is not 
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homologous to any other known dopamine receptor protein. Interestingly, this 
region of the human D4 receptor is not found in the rat homologue of the D4 
receptor, making this variation specific to humans. 

From these results we have concluded that the sequences we have isolated 
5 encode a polymorphic member of the dopamine receptor family. 

EXAMPLES 

Construction of an Mammalian DNA Expression 
Construct using Dopamine Receptor cDNA 

10 The Apal-Pstl gene fragment (Figure 1, the Pstl site found in exon m 

after transmembrane domain V) was ligated to the corresponding Pstl-EcoBl 
cDNA fragment isolated from the SK-N-MC cDNA. This construct was then 
cloned into the vector pCD-PS (Bonner et a/., 1988, Neuron I: 403-410). This 
vector allows for the expression of the human D4 receptor gene fom the SV40 

15 promoter. Large quantities of the pCD-PS-D4 construct plasmid were prepared 
using standard techniques (see, Sambrook et o/ M ibid.). This plasmid was 
transfected into COS-7 ceils by the calcium phosphate precipitation technique 
(Gorman etal., 1983, Science 221: 551-553). Two days later membranes cells 
were harvested and analyzed as described in Example 6. 

20 

EXAMPLE 6 

Analysis of Dopamine and Dopamine-Antagonist 
Binding of D4 Dopamine Receptor 

Cells were harvested and homogenized using a teflon pestle in 50 mM 

25 Tris-HCl (pH 7.4 at 4°C) buffer containing 5 mM EDTA, 1.5 mM CaCl 2> 5 mM 

MgCl 2 , 5 mM KCl and 120 mM NaCl. Homogenates were centrifuged for 15 

minutes at 39,0O0g, and the resulting pellets resuspehded in buffer at a 

concentration of 150-250 /xg/ml. For saturation experiments, 0,25 ml aliquots of 

each tissue homogenate was incubated in duplicate with increasing concentrations 

30 of pH]spiperone (70.3 Ci/mmol; 10-3000 pM final concentration) for 120 min at 

22°C in a total volume of 1 ml. The results of these experiments are shown in 

Figure 4. The results shown are representative of two independent experiments 
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each conducted in duplicate (the inset show a Scatcherd plot of the same data). 
Estimated (approximately 260 fmol/mg protein) and K, (70 pM) values were 
obtained by LIGAND computer program. 

Representative curves are shown in Figure 5 for the concentration 
5 dependent inhibition of pHJspiperone binding by various dopaminergic agonist 
and antagonists. Estimated K, values are listed in Table I along with the *s values 
obtained on the human D2 receptor expressed in GH(4)ZR(7) cells. For 
competition binding experiments, assays were initiated by the addition of 0.25 ml 
of membrane preparation and incubated in duplicate with the concentrations of 

10 competing ligands indicated in Figure 5 (10 -14 to 10* M) and fHJspiperone (150- 
300 pM) for 120 min at 22°C. Assays were terminated by rapid filtration through 
a Titertek cell harvester and filters subsequently monitored to quantitate 
radioactive tritium. For all experiments, specific pHJspiperone binding was 
defined as that binding inhibited by 10 pM (+)sulpiride. Both saturation and 

IS competition binding data were analyzed by the non-linear least square curve-fitting 
program LIGAND run on a Digital Micro-PDP-IL The human D4 dopamine 
receptor displays the following; pharmacological profile of inhibition of 
pHlspiperone binding in this assay: spiperone > eticlopride > clozapine > <+)- 
butaclamol > raclopride > SCH23390. 

20 

EXAMPLE 7 

Polymorphic Allelic Variants of the D4 Dopamine Receptor 
Isolated from Human Tissue cDNA Libraries 

25 Human cDNA libraries were screened for expression of polymorphic 

variants of the human D4 receptor gene. A human substantia nigra cDNA library 
construced in lambda gtll (Clontech) and a pituitary cDNA library constructed 
in lambda gtlO as described in Example 2 were screened for clones encoding the 
D4 receptor. Approximately 0.5-1 x 10 6 plaque-forming units (p.f.u.) were 

30 transferred in duplicate to nylon filters puPont/NEN) and probed with a 
^P-labeled 700 bp EcoRl-Xhol fragment encoding the cDNA isolated from the 
neuroepithelioma SK-N-MC under conditions as described in Example 2 above. 
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Screening of cDNA libraries from human pituitary and substantia nigra 
resulted in the isolation of variant cDNA clones of the D4 receptor. The pituitary 
lambda gtlO clone contained a 1.4-kb EcoRl insert, coding for intron 1 and the 
down-stream sequences of the D4 receptor. This pituitary D4 receptor clone also 

5 contained the second intron, but the last intron was spliced out* The isolated 
substantia nigra lambda gtll clone contained a 600-bp EcoBI insert, coding for 
the D4 receptor, starting in the 5' site of the putative third cytoplasmic loop. 
Both these clones contained a four-fold repeat (see Figure 6, wherein these 
sequences of these clones are designated D4.4 [SEQ ID No.; 19]) of the 48-bp 

10 sequence previously found as a 7-fold repeat in the D4 genomic clone D210G 
(Example 4) and a 2-fold repeat in the neuroblastoma SK-N-MC cDNA clone 
(Example 2) within the putative third cytoplasmic loop of the D4 receptor protein 
(compare, SEQ ID Nos.: 18, 20 & 22]. A comparison of the nucleic acid 
sequences revealed that, due to the absence of conventional splice junction sites 

15 in the seven-fold repeat sequence of the genomic clone, a novel splicing 
mechanism would be required to account for the existence of the different cDNA 
clones. 

Two different human genomic libraries from different human individuals 
(Clontech) were screened to detect allelic polymorphism in the human D4 receptor 

20 gene. Screening of genomic libraries resulted in the isolation of a genomic clone 
with a 4-fold repeat of the 48 bp sequence previously detected in pituitary and 
substantia nigra cDNA. This result indicated that the polymorphic cDNA 
molecules resulted from genetic polymorphic variation in the corresponding 
genomic DNA, due to the existence of polymorphic alleles in the human 

25 population for the D4 receptor. 

EXAMPLE 8 

Additional D4 Receptor Gene Allelic Variants 
30 Found bv RFLP Analysis of Human Genomic DNA 

The three different D4 receptor sequences predict a restriction fragment 

length polymorphism for a Hincll-Pstl fragment of the D4 gene (Figure 6). 
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Southern blot analysis of human genomic DNA was performed as described {see 
Sambrook et al. , ibid, and Example 3). A RFLP was observed in humans and the 
different allelic fragments were sized. 

Briefly, high molecular weight genomic DNA was isolated from human 
5 blood samples using proteinase K and phenol/chloroform extractions. Genomic 
DNA (5 fig) was digested with the restriction endonucleases HincU and Pstl and 
size separated by agarose (1%) gel electrophoresis. DNA was transferred to 
nylon membranes (Zeta-probe, Biorad) according to standard techniques 
(Sambrook et al., ibid.). Southern blots were probed with a 32 P-labeled 600 bp 

10 EcoKL-HincTl fragment, coding for the D4 cDNA isolated from the 
neuroepithelioma SK-N-MC, and washed at high stringency (65 °C, O.lxSSC, 
0. 1 % SDS, 40 min). The blot was exposed to X-ray film for three days. Results 
of these experiments are shown in Figure 7. 

The position of a 540-bp size marker is indicated on the left. D4- 

15 hybridizing polymorphic bands can be sera at approximately 520 bp, 620 bp, 710 
bp, 760 bp and 800 bp. [It will be recognized to those with skill in this art that 
the sizes given herein for the alleles of the human D4 dopamine receptor gene are 
limited in their precision to the resolving power of the agarose gels used in the 
analyses. The sizes are approximate as given herein, and more exact sizes can 

20 be calculated from the sequences of the different alleles found in SEQ ID Nos: 
17, 19 & 21.] The 520 bp, 620 bp and 760 bp fragments correlate closely with 
the sizes of the HincU-Pstl fragments of the cloned D4 receptor variants with the 
two-, four- and seven-fold repeat sequences respectively. The presence of 710 
bp and 800 bp fragments suggests that variants with six-fold and eight-fold repeat 

25 sequences also exist. Additional polulation screening experiments have resulted 
in the detection of alleles corresponding to three-fold and five-fold repeats. A 
total of 7 alleles of the D4 receptor gene have accordingly been found in the 
human population. 
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EXAMPLE 9 
Expression of Allelic Variants of the D4 Receptor 
Mammalian DNA expression constructs were made as described in 
Example 5 for expression of the allelic variants of the D4 receptor. Various 
5 cDNA constructs were cloned into the expression vector pCD-PS (see Example 
5) which contains the SV40 origin of replication and drives expression of the 
cloned inserts from the SV40 late promoter. A 1.7-kb KpriL-Xbal fragment 
comprising a cDNA for the D4 receptor gene containing the 7-fold repeat was 
cloned into the pCD-PS vector of Example 5 and called hereafter pCD-D4.7. 

10 Full-length cDNA clones for the D4.2 and D4.4 forms of the receptor were made 
by in vitro recombination between partial cDNA clones of these forms with the 
full-length cDNA clone of the D4.7 receptor variant. The clone pCD-D4.4 was 
created by substituting the 920-bp Pstl-EeoRl 3 f fragment of pCD-D4.7 with the 
730-bp Pstl-EcoXJ fragment of the D4 cDNA isolated from human pituitary. In 

15 a similar fashion the clone pCD-D4.2 was constructed by exchange of this 3' 
Pstl-EcdRl fragment of pCD-D4.7 with a 630-bp Pstl-EcoRl fragment of the D4.2 
cDNA clone isolated from the neuroepithelioma SK-N-MC. 

v Transient expression in COS-7 cells was achieved as follows. Cells 
harvested and washed in phosphate buffered saline (PBS). 5xl0 7 cells were 

20 resiispended in 1 ml PBS with 100 /xg/ml plasmid DNA (purified by caesium 
chloride gradient centrifugation) and incubated for 10 min on ice. Next, 400 ^1 
aliquots of the cell suspension were subjected to an electric field of 0.65 kV/cm, 
4. 1 ms pulse duration using a BTX 600 Electro Cell Manipulator (Biotechnologies 
& Experimental Research, Inc., San Diego, CA). After the electric pulse, the 

25 cells were incubated for another 10 min on ice and then seeded in Modified 
Eagle's Medium supplemented with 10% fetal calf serum. The next day the 
medium was renewed. Three days after electroporation the cells were harvested 
and stored at -80°C until use in receptor binding studies as described herein 

Expression of each of the cloned variant D4 receptor constructs was 

30 demonstrated by Northern blot analysis as described in Example 1. Blots were 
hybridized with the 700 bp EcoKl-XhoI fragment of the D4 cDNA isolated from 
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the neuroepithelioma SK-N-MC (Example 2). The results of these experiments 
are shown in Figure 8. Transient expression of the three forms in COS-7 cells 
as characterized in these experiments demonstrated the expected size and size 
differences between the three forms, indicating that none of the expressed D4 
5 receptor RNAs are further processed or produced from one another by RNA 
splicing events. Furthermore, the two bands observed for the D4.2 and D4.4 
clones represent the consequence of the use of either the endogenous D4 receptor 
polyadenylation signal or the SV40 (vector-derived) polyadenylation signal). 
These observations indicate that in the transient expression system the expression 
10 of the three different clones would result in the formation of three structurally 
different receptors. 

EXAMPLE 10 

Analysis of Dopamine and Dopamine- Antagonist 
IS Binding of Variant D4 Dopamine Receptors 

Pharmacological analysis of dopamine agonist and antagonist binding was 

performed as described in Example 6. The results of these experiments are 

shown in Figure 9. Panels (a) illustrate Scatchard analysis of the saturation 

isotherms for pHJspiperone binding to membranes prepared from COS-7 cells 

20 transiently transfected with pCD-D4.2 (D4.2), pCD-D4.4 (DA A) and pCD-D4.7 
(D4.7). Panels (b) show clozapine competition of [ 3 H]spiperone binding for the 
three allelic forms of the D4 receptor in the presence (+Na + ) and absence (-Na + ) 
of sodium chloride. 

Pharmacological analysis demonstrated that all three variants displayed 

23 saturable [^spiperone binding (300-1000 fmol mg' 1 ) with similar dissociation 
constants in the absence of sodium chloride (Kj = 40-50 pM; Figure 4a). 
However, in the presence of 120 mM sodium chloride, the dissociation constants 
increased approximately two- to three-fold for D4.2 and D4.4 but not for D4.7. 
Clozapine competition of [ 3 H]spiperone binding revealed that D4.2 and 

30 D4.4 had lower dissociation constants for clozapine in the absence of sodium 
chloride (K~ 3nM without sodiumn chloride; Kj= 23nM with sodium chloride). 
D4.7 had a dissociation constant of approximately IS nM for clozapine which did 
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not exhibit sodium chloride sensitivity (K<= 12nM without sodium chloride; Kj= 
18nM with sodium chloride; shown in Figure 4b). This sodium chloride-mediated 
effect for clozapine on the D4 variants was not modulated by guanine nucleotides. 
Agonists and antagonists (dopamine, bromocriptine, raclopride and 

5 clozapine) inhibited [ 3 H]spiperone binding (in the presence of sodium chloride) to 
these different D4 receptor variants in a concentration-dependent manner with 
similar dissociations constants. Furthermore, all three variants exhibited a 
guanine nucleotide-sensitive high-affinity form of the receptor upon competition 
with dopamine, suggesting that all these variants can functionally couple to 

10 G-proteins. Thus, we have defined a novel, polymorphic dopamine receptor 
which we term D4. 

It should be understood that the foregoing disclosure emphasizes certain 
specific embodiments of the invention and that all modifications or alternatives 
equivalent thereto are within the spirit and scope of the invention as set forth in 

15 the appended claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION ; 



(i) APPLICANT: 

(A) NAMEs State of Oregon 

(B) STREETS Oregon Health Sciences Univ., 3181 S.W. Sam 

Jackson Park Road 

(C) CITY: Portland 

(D) STATE: Oregon 

(E) COUNTRY l USA 

(F) POSTAL CODE (ZIP)s 97201-3098 

(G) TELEPHONE: 503-494-8200 

(H) TELEFAX: (S03)-494-4729 

(ii) TITLE OF INVENTION: A Novel Human Dopamine Receptor and Uses 
(iii) NUMBER OF SEQUENCES: 22 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.25 (EPO) 

(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US93/ 

(2) INFORMATION FOR SBQ ID NO:l: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: 5'UTR 

(B) LOCATION: 1..103 

(ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION: 104 ..388 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 104.. 388 

(x) PUBLICATION INFORMATION: 

(A) AUTHORS: Van Tol r Hubert H.M. 

Wu, Caren M. 
Guan, Hong-Chang 
Ohara, Koichi 
Bunzow, James R. 
• Civelli, Olivier 
Kennedy, James 
Seeman, Phillip 
Nixnik, Hyman B. 
Jovanovic, Vera 

(B) TITLE: Multiple dopamine D4 receptor variants in the 

human population T 

(C) JOURNAL: Natur 

(D) VOLUME: 358 
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(F) PAGES: 149-152 

(G) DATE: 9 JULY-1992 

( x ) PUBLICATION INFORMATION : 

(A) AUTHORS i Van Tol, Hubert H.M« 

Bunzow, James R. 
Guan, Hong-Chang 
Sunahara, Roger K» 
Seeuan, Phillip 
Niznik, Hyman B. 
Civelli, Olivier 

(B) TITLE t Cloning of the gene for a human dopamine 04 

receptor with high affinity for the antipsychotic 
clozapine 

(C) JOURNALS Nature 
(0) VOLUME: 350 

(F) PAGES t 610*614 

(G) DATE: 18 April-1991 

(K) RELEVANT RESIDUES IN SEQ ID N0:1: FROM 1 TO 388 

(xl) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

CGGGGGCGGG ACCAGGGTCC GGCCGGGGCG TGCCCCCGGG GAGGGACTCC CCGGCTTGCC 60 

CCCCGGCGTT GTCCGCGGTG CTCAGCGCCC GCCCGGGCGC GCC ATG GGG AAC CGC 115 

Met Gly Asn Arg 
1 

AGC ACC GCG GAC GCG GAC GGG CTG CTG GCT GGG CGC GGG CGG GCC GCG 163 
Ser Thr Ala Asp Ala Asp Gly Leu Leu Ala Gly Arg Gly Arg Ala Ala 
5 10 15 20 

GGG GCA TCT GCG GGG GCA TCT GCG GGG CTG GCT GGG CAG GGC GCG GCG 211 
Gly Ala Ser Ala Gly Ala Ser Ala Gly Leu Ala Gly Gin Gly Ala Ala 
25 30 35 

GCG CTG GTG GGG GGC GTG CTG CTC ATC GGC GCG GTG CTC GCG GGG AAC 259 
Ala Leu Val Gly Gly Val Leu Leu lie Gly Ala Val Leu Ala Gly Asn 
40 45 50 

TCG CTC GTG TGC GTG AGC GTG GCC ACC GAG CGC GCC CTG CAG ACG CCC 307 
Ser Leu Val Cys Val Ser Val Ala Thr Glu Arg Ala Leu Gin Thr Pro 
55 60 65 

ACC AAC TCC TTC ATC GTG AGC CTG GCG GCC GCC GAC CTC CTC CTC GCT 355 
Thr Asn Ser Phe He Val Ser Leu Ala Ala Ala Asp Leu Leu Leu Ala 
70 75 80 

CTC CTG GTG CTG CCG CTC TTC GTC TAC TCC GAG 388 
Leu Leu Val Leu Pro Leu Phe Val Tyr Ser Glu 
85 90 95 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Gly Asn Arg Ser Thr Ala Asp Ala Asp Gly Leu Leu Ala Gly Arg 
1 5 10 15 

Gly Arg Ala Ala Gly Ala Ser Ala Gly Ala Ser Ala Gly Leu Ala Gly 
20 25 30 

Gin Gly Ala Ala Ala Leu Val Gly Gly Val Leu Leu He Gly Ala Val 
35 40 45 

Leu Ala Gly Aen Ser Leu Val Cys Val Ser Val Ala Thr Glu Arg Ala 
50 55 60 

Leu Gin Thr Pro Thr Asn Ser Phe He Val Ser Leu Ala Ala Ala Asp 
65 70 75 80 

Leu Leu Leu Ala Leu Leu Val Leu Pro Leu Phe Val Tyr Ser Glu 

85 .90 95 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 1..20 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /partial 

/cons splice- (5'site: YES, 3 'site: NO) 
/evidence* EXPERIMENTAL 
/labels intronl 

/note» -This is the 5' sequence of an intron 
estimated to be 2.0 kilobases in length** 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

GTGAGCCGCG TCCGGCCGCA 20 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 1.-20 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /partial 

/cons splice- (5'site: NO, 3 'site: YES) 
/evidences EXPERIMENTAL 
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/labels intronl 

/note* "This is the 3' sequence of a intron 
estimated to be 2.0 kilobases in length.*' 

(xi) SEQUENCE DESCRIPTION: . SEQ ID NO:4t 
CCTGTGGTGT CGCCGCGCAG 20 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH t 113 base pairs 

(B) TYPE i nucleic acid 

(C) STRANDEDNESSi single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION: 1..113 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
<B) LOCATION: 1..113 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GTC CAG GGT GCC GCG TGG CTG CTG ACC CCC CGC CTG TGC GAC GCC CTC 48 
Val Gin Gly Gly Ala Trp Leu Leu Ser Pro Arg Leu Cys Asp Ala Leu 
1 5 10 15 

ATG GCC ATG GAC GTC ATG CTG TGC ACC GCC TCC ATC TTC AAC CTG TGC 96 
Met Ala Met Asp Val Met Leu Cys Thr Ala Ser He Phe Asn Leu Cys 
20 25 30 

GCC ATC AGC GTG GAC AG 113 
Ala He Ser Val Asp 
35 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Val Gin Gly Gly Ala Trp Leu Leu Ser Pro Arg Leu Cys Asp Ala Leu 
1 5 10 15 

Met Ala Met Asp Val Met Leu Cys Thr Ala Ser He Phe Asn Leu Cys 
20 25 30 

Ala He Ser Val Asp 
35 



- 30 - 



WO 94/03602 



PCT/US93/07370 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 base pairs 
(0) TYPEs nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: UNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY: intron 

(B) LOCATION: 1..102 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /evidence" EXPERIMENTAL 

/labels intronll 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GTGCGCCGCC CTCCCCGCCC GCGCCCCGGC GCCCCCGCGC CCCGCCCGCC GCCCTCACCG 60 
CGGCCTGTGC GCTGTCCGGC GCCCCCTCGG CGCTCCCCGC AG 102 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 563 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 1..563 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /evidence" EXPERIMENTAL 

/standard name* "Alternate Exon 3: D4.2" 
/note- "Tn*is sequence represent the sequence of 
the third exon of allele 04 .2 of the human D4 
dopamine receptor gene" 

(ix) FEATURE: 

(A) NAME/KEY: miac_feature 

(B) LOCATION: 257.. 262 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /function- "Polymorphic PstI site" 

/evidence- EXPERIMENTAL 
/label- PstI 

/note= "This feature is the site of one of the 
restriction enzymes whereby digestion of genomic 
DNA produces a RFLP " 

(ix) FEATURE: 

(A) NAME /KEY: repeat_region 

(B) LOCATION: 346.. 442 

(D) OTHER INFORMATION: /rpt type= "tandem" 
/rpt_unit» 348 39? 

/note* "This sequence represents ne of 7 known 
alleles of human D4 dopamine receptor gene 
encoding a 16 amino acid sequence repeated twice 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2 ..563 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

G TTC GTG GCC GTG GCC GTG COG CTG CCC TAC AAC CGG CAG GGT GGG 46 
Phe Val Ala Val Ala Val Pro Leu Arg Tyr Asn Arg Gin Gly Gly 
1 5 10 15 

AGC CGC CGG CAG CTG CTG CTC ATC GGC GCC ACG TGG CTG CTG TCC GCG 94 
Ser Arg Arg Gin Leu Leu Leu lie Gly Ala Thr Trp Leu Leu Ser Ala 
20 25 30 

GCG GTG GCG GCG CCC GTA CTG TGC GGC CTC AAC GAC GTG CGC GGC CGC 142 
Ala Val Ala Ala Pro Val Leu Cys Gly Leu Asn Asp Val Arg Gly Arg 
35 40 45 

GAC CCC GCC GTG TGC CGC CTG GAG GAC CGC GAC TAC GTG GTC TAC TCG 190 
Asp Pro Ala Val Cys Arg Leu Glu Asp Arg Asp Tyr Val Val Tyr Ser 
50 55 60 

TCC GTG TGC TCC TTC TTC CTA CCC TGC CCG CTC ATG CTG CTG CTG TAC 238 
Ser Val Cys Ser Phe Phe Leu Pro Cys Pro Leu Met Leu Leu Leu Tyr 
65 70 75 

TGG GCC ACG TTC CGC GGC CTG CAG CGC TGG GAG GTG GCA CGT CGC GCC 286 
Trp Ala Thr Phe Arg Gly Leu Gin Arg Trp Glu Val Ala Arg Arg Ala 
80 85 90 95 

AAG CTG CAC GGC CGC GCG CCC CGC CGA CCC AGC GGC CCT GGC CCG CCT 334 
Lys Leu His Gly Arg Ala Pro Arg Arg Pro Ser Gly Pro Gly Pro Pro 
100 105 110 

TCC CCC ACG CCA CCC GCG CCC CGC CTC CCC CAG GAC CCC TGC GGC CCC 382 
Ser Pro Thr Pro Pro Ala Pro Arg Leu Pro Gin Asp Pro Cys Gly Pro 
115 120 125 

GAC TGT GCG CCC CCC GCG CCC GGC CTC CCC CCG GAC CCC TGC GGC TCC 430 
Asp Cys Ala Pro Pro Ala Pro Gly Leu Pro Pro Asp Pro Cys Gly Ser 
130 . 135 140 

AAC TGT GCT CCC CCC GAC GCC GTC AGA GCC GCC GCG CTC CCA CCC CAG 478 
Asn Cys Ala Pro Pro Asp Ala Val Arg Ala Ala Ala Leu Pro Pro Gin 
145 150 155 

ACT CCA CCG CAG ACC CGC AGG AGG CGG CGT GCC AAG ATC ACC GGC CGG 526 
Thr Pro Pro Gin Thr Arg Arg Arg Arg Arg Ala Lys He Thr Gly Arg 
160 165 170 175 

GAG CGC AAG GCC ATG AGG GTC CTG CCG GTG GTG GTC G 563 
Glu Arg Lys Ala Met Arg Val Leu Pro Val Val Val 
180 185 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 187 acino acidB 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Phe Val Ala Val Ala Val Pro Leu Arg Tyr Asn Arg Gin Gly Gly Ser 
1 5 10 15 

Ara Arg Gin Leu Leu Leu lie Gly Ala Thr Trp Leu Leu Ser Ala Ala 
20 25 30 

Val Ala Ala Pro Val Leu Cye Gly Leu Asn Aep Val Arg Gly Arg Aep 
35 40 45 

Pro Ala Val Cys Arg Leu Glu Asp Arg Aep Tyr Val Val Tyr Ser Ser 
50 55 60 

Val Cys Ser Phe Phe Leu Pro Cye Pro Leu Met Leu Leu Leu Tyr Trp 
65 70 75 80 

Ala Thr Phe Arg Gly Leu Gin Arg Trp Glu Val Ala Arg Arg Ala Lye 
85 90 95 

Leu His Gly Arg Ala Pro Arg Arg Pro Ser Gly Pro Gly Pro Pro Ser 

100- 105 110 

Pro Thr Pro Pro Ala Pro Arg Leu Pro Gin Asp Pro Cys Gly Pro Asp 
115 120 125 

Cys Ala Pro Pro Ala Pro Gly Leu Pro Pro Asp Pro Cys Gly Ser Asn 
130 135 140 

Cys Ala Pro Pro Asp Ala Val Arg Ala Ala Ala Leu Pro Pro Gin Thr 
145 150 155 160 

Pro Pro Gin Thr Arg Arg Arg Arg Arg Ala Lys lie Thr Gly Arg Glu 
165 170 175 

Arg Lys Ala Met Arg Val Leu Pro Val Val Val 
180 185 

(2) INFORMATION FOR SEQ ID NO: 10 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 659 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

( ix) FEATURE: 

(A) NAME /KEY: axon 

(B) LOCATION: 1..659 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /evidence" EXPERIMENTAL 

/standard name= "Alternate Exon 3: D4.4" 

/notes -This sequence represents the third exon of 

allele D4.4 of the human D4 dopamine receptor 

gene" 

(ix) FEATURE: 

(A) NAME /KEY: misc feature 

(B) LOCATION: 257.7262 

(C) IDENTIFICATION METHOD i experimental 

(D) OTHER INFORMATION: /function- "PstI site" 

/evidence* EXPERIMENTAL 
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/standard names "PstI site" 
/label- PstI 

/note- "This sequence represents a polymorphic 
PstI site whereby digestion of human genomic DNA 
produces a RFLP M 

(ix) FEATURE: 

(A) NAME/KEY: repeat region 

(B) LOCATION: 346.. 538 

(C) IDENTIFICATION METHOD t experimental 

(D) OTHER INFORMATION : /rpt_type= "tandem" 

/evidence- EXPERIMENTAL 
/rpt unit- 348 .. 396 

/note- "This repeat is present in 7 known alleles 
of the human D4 dopamine receptor gene and encodes 
a 16 amino acid sequence repeated 4 times in the 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 2 ..659 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

G TTC GTG GCC GTG GCC GTG CCG CTG CGC TAC AAC CGG CAG GGT GGG 46 
Phe Val Ala Val Ala Val Pro Leu Arg Tyr Asn Arg Gin Gly Gly 
1 5 10 15 

AGO CGC CGG CAG CTG CTG CTC ATC GGC GCC ACG TGG CTG CTG TCC GCG 94 
Ser Arg Arg Gin Leu Leu Leu He Gly Ala Thr Trp Leu Leu Ser Ala 
20 25 30 

GCG GTG GCG GCG CCC GTA CTG TGC GGC CTC AAC GAC GTG CGC GGC CGC 142 
Ala Val Ala Ala Pro Val Leu Cys Gly Leu Asn Asp Val Arg Gly Arg 
35 40 45 

GAC CCC GCC GTG TGC CGC CTG GAG GAC CGC GAC TAC GTG GTC TAC TCG 190 
Asp Pro Ala Val Cys Arg Leu Glu Asp Arg Asp Tyr Val Val Tyr Ser 
50 55 60 

TCC GTG TGC TCC TTC TTC CTA CCC TGC CCG CTC ATG CTG CTG CTG TAC 238 
Ser Val Cys Ser Phe Phe Leu Pro Cys Pro Leu Met Leu Leu Leu Tyr 
65 70 75 

TGG GCC ACG TTC CGC GGC CTG CAG CGC TGG GAG GTG GCA CGT CGC GCC 286 
Trp Ala Thr Phe Arg Gly Leu Gin Arg Trp Glu Val Ala Arg Arg Ala 
80 85 90 95 

AAG CTG CAC GGC CGC GCG CCC CGC CGA CCC AGC GGC CCT GGC CCG CCT 334 
Lys Leu His Gly Arg Ala Pro Arg Arg Pro Ser Gly Pro Gly Pro Pro 
100 105 HO 

TCC CCC ACG CCA CCC GCG CCC CGC CTC CCC CAG GAC CCC TGC GGC CCC 382 
Ser Pro Thr Pro Pro Ala Pro Arg Leu Pro Gin Asp Pro Cys Gly Pro 
115 120 125 

GAC TGT GCG CCC CCC GCG CCC GGC CTT CCC CGG GGT CCC TGC GGC CCC 430 
Asp Cys Ala Pro Pro Ala Pro Gly Leu Pro Arg Gly Pro Cys Gly Pro 
130 135 140 

GAC TGT GCG CCC GCC GCG CCC AGC CTC CCC CAG GAC CCC TGC GCC CCC 478 
Asp Cys Ala Pro Ala Ala Pro Ser Leu Pro Gin Asp Pro Cys Gly Pro 
145 150 155 
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GAC 
ABp 
160 


TGT 
Cys 


GCG 
Ala 


CCC CCC. GCG 
Pro Pro Ala 
165 


CCC 
Pro 


GGC 
Gly 


CTC 
Leu 


CCC 
Pro 


CCG 
Pro 
170 


GAC 
Asp 


CCC 
Pro 


TGC 
Cys 


GGC 
Gly 


TCC 
Ser 

175 


526 


AAC 
Abxi 


TGT 
Cys 


GCT 
Ala 


CCC CCC GAC 
Pro Pro Aep 
180 


GCC 
Ala 


GTC 
Val 


AGA 
Arg 


GCC 
Ala 
185 


GCC 
Ala 


GCG 
Ala 


CTC 
Leu 


CCA 
Pro 


CCC 
Pro 
190 


CAG 
Gin 


574 


ACT 
Thr 


CCA 
Pro 


CCG 

Pro 


CAG ACC CGC 
Gin Thr Arg 
195 


AGG 
Arg 


AGG 
Arg 


CGG 
Arg 
200 


CGT 
Arg 


GCC 
Ala 


AAG 
Lys 


ATC 
lie 


ACC 
Thr 
205 


GGC 
Gly 


CGG 

Arg 


622 


GAG 
GlU 


CGC 
Arg 


AAG 
Lys 

210 


GCC ATG AGG 
Ala Met Arg 


GTC 
Val 


CTG 
Leu 
215 


CCG 
Pro 


GTG 
Val 


GTG 
Val 


GTC 
Val 


G 








659 



(2) INFORMATION FOR SEQ ID NO tilt 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH t 219 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Phe Val Ala Val Ala Val Pro Leu Arg Tyr Asn Arg Gin Gly Gly Ser 
1 5 10 15 

Arg Arg Gin Leu Leu Leu lie Gly Ala Thr Trp Leu Leu Ser Ala Ala 
20 25 30 

Val Ala Ala Pro Val Leu Cys Gly Leu Asn Asp Val Arg Gly Arg Asp 
35 40 45 

Pro Ala Val Cys Arg Leu Glu Asp Arg Asp Tyr Val Val Tyr Ser Ser 
50 55 60 

Val Cys Ser Phe Phe Leu Pro Cys Pro Leu Met Leu Leu Leu Tyr Trp 
65 70 75 80 

Ala Thr Phe Arg Gly Leu Gin . Arg Trp Glu Val Ala Arg Arg Ala Lys 
85 90 95 

Leu His Gly Arg Ala Pro Arg Arg Fro Ser Gly Pro Gly Pro Pro Ser 
100 105 110 

Pro Thr Pro Pro Ala Pro Arg Leu Pro Gin Asp Pro Cys Gly Pro Asp 
115 120 125 

Cys Ala Pro Pro Ala Pro Gly Leu Pro Arg Gly Pro Cys Gly Pro Asp 
130 135 140 

Cys Ala Pro Ala Ala Pro Ser Leu Pro Gin Asp Pro Cys Gly Pro Asp 
145 150 155 160 

Cys Ala Pro Pro Ala Pro Gly Leu Pro Pro Asp Pro Cys Gly Ser Asn 
165 170 175 

Cys Ala Pro Pro Asp Ala Val Arg Ala Ala Ala Leu Pro Pro Gin Thr 
180 185 190 

Pro Pro Gin Thr Arg Arg Arg Arg Arg Ala Lys He Thr Gly Arg Glu 

195 200 . 205 
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Arg Lya Ala Met Arg Val Leu Pro Val Val Val 
210 215 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONSSS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION: 1..803 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /evidences EXPERIMENTAL 

/standard name- "Alternate Exon 3: D4.7" 

/note° "TKis sequence represents the third exon of 

allele D4.7 of the human D4 dopamine receptor 

gene" 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 257.7262 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /functions "PstI site" 

/evidence- EXPERIMENTAL 
/standard names "PstI site" 
/label- PstI 

/note- "This sequence is a PstI site whereby 
digestion of human genomic DNA produces a RFLP" 

( ix ) FEATURE : 

(A) NAME/KEY: repeat_region 

(B) LOCATION: 346.. 682 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /rpt_type» "tandem" 

/evidences EXPERIMENTAL 
/rpt unit- 346 ..394 

/note- "This sequence is a repeat found in 7 known 
alleles of the human D4 dopamine receptor gene 
encoding a 16 amino acid sequence repeated 7 times 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 803 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

G TTC GTG GCC GTG GCC GTG CCG CTG CGC TAC AAC CGG CAG GGT GGG 46 
Phe Val Ala Val Ala Val Pro Leu Arg Tyr Asn Arg Gin Gly Gly 
1 5 . 10 15 

AGC CGC CGG CAG CTG CTG CTC ATC GGC GCC ACG TGG CTG CTG TCC GCG 94 
Ser Arg Arg Gin Leu Leu Leu He Gly Ala Thr Trp Leu Leu Ser Ala 
20 25 30 

GCG GTG GCG GCG CCC GTA CTG TGC GGC CTC AAC GAC GTG CGC GGC CGC 142 
Ala Val Ala Ala Pro Val Leu Cys Gly Leu. Asn Asp Val Arg Gly Arg 
35 40 45 
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GAC CCC GCC CTG TGC CGC CTG GAG GAC CGC GAC TAC GTG GTC TAC TCG 190 
Asp Pro Ala Val Cys Arg Leu Glu Asp Arg Asp Tyr Val Val Tyr Ser 
50 55 60 

TCC GTG TGC TCC TTC TTC CTA CCC TGC CCG CTC ATG CTG CTG CTG TAC 238 
Ser Val Cys Ser Phe Phe Leu Pro Cys Pro Leu Met Leu Leu Leu Tyr 
65 70 75 

TGG GCC ACG TTC CGC GGC CTG CAG CGC TGG GAG GTG GCA CGT CGC GCC 286 
Trp Ala Thr Phe Arg Gly Leu Gin Arg Trp Glu Val Ala Arg Arg Ala 
80 85 90 95 

AAG CTG GAC GGC CGC GCO CCC CGC CGA CCC AGC GGC CCT GGC CCG CCT 334 
Lys Leu His Gly Arg Ala Pro Arg Arg Pro Ser Gly Pro Gly Pro Pro 
100 105 110 

TCC CCC ACG CCA CCC GCO CCC CGC CTC CCC CAG GAC CCC TGC GGC CCC 382 
Ser Pro Thr Pro Pro Ala Pro Arg Leu Pro Gin Asp Pro Cys Gly Pro 
115 120 125 

GAC TGT GCG CCC CCC GCG CCC GGC CTT CCC CGG GGT CCC TGC GGC CCC 430 
Asp Cys Ala Pro Pro Ala Pro Gly Leu Pro Arg Gly Pro Cys Gly Pro 
130 135 140 

GAC TGT GCG CCC GCC GCG CCC GGC CTC CCC CCG GAC CCC TGC GGC CCC 478 
Asp Cys Ala Pro Ala Ala Pro Gly Leu Pro Pro Asp Pro Cys Gly Pro 
145 150 155 

GAC TGT GCG CCC CCC GCG CCC GGC CTC CCC CAG GAC CCC TGC GGC CCC 526 
Asp Cys Ala Pro Pro Ala Pro Gly Leu Pro Gin Asp Pro Cys Gly Pro 
160 165 170 175 

GAC TGT GCG CCC CCC GCG CCC GGC CTT CCC CGG GGT CCC TGC GGC CCC 574 
Asp Cys Ala Pro Pro Ala Pro Gly Leu Pro Arg Gly Pro Cys Gly Pro 
180 185 190 

GAC TGT GCG CCC CCC GCG CCC GGC CTC CCC CAG GAC CCC TGC GGC CCC 622 
Asp Cys Ala Pro Pro Ala Pro Gly Leu Pro Gin Asp Pro Cys Gly Pro 
195 200 205 

GAC TGT GCG CCC CCC GCG CCC GGC CTC CCC CCG GAC CCC TGC GGC TCC 670 
Asp Cys Ala Pro Pro Ala Pro Gly Leu Pro Pro Asp Pro Cys Gly Ser 
210 215 220 

AAC TGT GCT CCC CCC GAC GCC GTC AGA GCC GCC GCG CTC CCA CCC CAG 718 
Asn Cys Ala Pro Pro Asp Ala Val Arg Ala Ala Ala Leu Pro Pro Gin 
225 230 235 

ACT CCA CCG CAG ACC CGC AGG AGG CGG CGT GCC AAG ATC ACC GGC CGG 766 
Thr Pro Pro Gin Thr Arg Arg Arg Arg Arg Ala Lys lie Thr Gly Arg 
240 245 250 255 

GAG CGC AAG GCC ATG AGG GTC CTG CCG GTG GTG GTC G 803 
Glu Arg Lys Ala Met Arg Val Leu Pro Val Val Val 
260 265 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 267 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: prot in 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Phe Val Ala Val Ala Val Pro Leu Arg Tyr Asn Arg Gin Gly Gly Ser 
1 5 10 15 

Arg Arg Gin Leu Leu Leu lie Gly Ala Thr Trp Leu Leu Ser Ala Ala 
20 25 30 

Val Ala Ala Pro Val Leu Cys Gly Leu Aen Asp Val Arg Gly Arg Asp 
35 40 45 

Pro Ala Val Cys Arg Leu Glu Asp Arg Asp Tyr Val Val Tyr Ser Ser 
50 55 60 

Val Cys Ser Phe Phe Leu Pro Cys Pro Leu Met Leu Leu Leu Tyr Trp 
65 70 75 80 

Ala Thr Phe Arg Gly Leu Gin Arg Trp Glu Val Ala Arg Arg Ala Lys 
85 90 95 

Leu His Gly Arg Ala Pro Arg Arg Pro Ser Gly Pro Gly Pro Pro Ser 
100 105 110 

Pro Thr Pro Pro Ala Pro Arg Leu Pro Gin Asp Pro Cys Gly Pro Asp 
115 120 125 

Cys Ala Pro Pro Ala Pro Gly Leu Pro Arg Gly Pro Cys Gly Pro Asp 
130 135 140 

Cys Ala Pro Ala Ala Pro Gly Leu Pro Pro Asp Pro Cys Gly Pro Asp 
145 150 155 160 

Cya Ala Pro Pro Ala Pro Gly Leu Pro Gin Asp Pro Cys Gly Pro Asp 
165 170 175 

Cys Ala Pro Pro Ala Pro Gly Leu Pro Arg Gly Pro Cys Gly Pro Asp 
180 185 190 

CyB Ala Pro Pro Ala Pro Gly Leu Pro Gin Asp Pro Cys Gly Pro Asp 
195 200 . 205 

Cys Ala Pro Pro Ala Pro Gly Leu Pro Pro Asp Pro Cys Gly Ser Asn 
210 215 220 

Cys Ala Pro Pro Asp Ala Val Arg Ala Ala Ala Leu Pro Pro Gin Thr 
225 230 235 240 

Pro Pro Gin Thr Arg Arg Arg Arg Arg Ala Lys lie Thr Gly Arg Glu 
245 250 255 

Arg Lys Ala Met Arg Val Leu Pro Val Val Val 
260 265 

(2) INFORMATION FOR SEQ ID NO:14t 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (gen mic) 



(ix) FEATURE: 
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(A) NAME/KEY: intron 

(B) LOCATI N: 1..94 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GTGGGTTCCT GTCCTGAGGG GCGGGGAGGA GAGGAGGGGG GGAGTACGAG GCCGGCTGGG 60 
CGGGGGGCGC TAACGCGGCT CTCGGCGCCC CCAG 94 

(2) INFORMATION FOR SEQ ID NO:15: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 328 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: axon 

(B) LOCATION: 1..328 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 3.. 203 

( ix ) FEATURE : 

(A) NAME /KEY: 3'UTR 

(B) LOCATION: 204. .328 

( ix ) FEATURE J 

(A) NAME/KEY: polyA site 

(B) LOCATION: 304 ~ 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION: 36. * 41 

(C) IDENTIFICATION METHOD: experimental 

(D) OTHER INFORMATION: /function* "Hindi site" 

/evidence= EXPERIMENTAL 
/standard names "HinCII site" 
/label" HlnCII 

/note" "This sequence is a HinCII site whereby 
digestion of genomic DNA produces a RFLP" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GG GCC TTC CTG CTG TGC TGG ACG CCC TTC TTC GTG GTG CAC ATC ACG 47 
Ala Phe Leu Leu Cys Trp Thr Pro Phe Phe Val Val His He Thr 
1 5 10 15 

CAG GCG CTG TGT CCT GCC TGC TCC GTG CCC CCG CGG CTG GTC AGC GCC 95 
Gin Ala Leu Cys Pro Ala Cys Ser Val Pro Pro Arg Leu Val Ser Ala 
20 25 30 

GTC ACC TGG CTG GGC TAC GTC AAC AGC GCC CTC ACC CCC GTC ATC TAC 143 
Val Thr Trp Leu Gly Tyr Val Asn Ser Ala Leu Thr Pro Val He Tyr 
35 40 45 
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ACT TC TTC AAC CCC GAG TTC CGC AAC GTC TTC CGC AAG GCC CTG COT 191 
Thr Val Phe Asn Ala Glu Phe Arg Asn Val Phe Arg Lys Ala Lou Arg 
50 55 60 

GCC TGC TGC TGAGCCGGGC ACCCCCGGAC GCCCCCCGGC CTGATGGCCA 240 
Ala Cya Cys 
65 

GOCCTCAGGG ACCAAGGAGA TGGGGAGGGC GCTTTTGTAC GTTAATTAAA CAAATTCCTT 300 
CCCAAACTCA GCTGTGAAGG CTCCTGGG 328 

(2) INFORMATION FOR SSQ ID NO t 16 t 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH t 66 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Ala Phe Leu Leu Cya Trp Thr Pro Phe Phe Val Val His lie Thr Gin 
1 5 10 15 

Ala Leu Cya Pro Ala Cya Ser Val Pro Pro Arg Leu Val Ser Ala Val 
20 25 30 

Thr Trp Leu Gly Tyr Val Asn Ser Ala Leu Thr Pro Val lie Tyr Thr 
35 40 45 

Val Phe Asn Ala Glu Phe Arg Asn Val Phe Arg Lys Ala Leu Arg Ala 
50 55 60 

Cys Cys 
65 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1370 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: 5'UTR 

(B) LOCATION: 1..103 

(ix) FEATURE: 

(A) NAME /KEY: 3'UTR 

(B) LOCATION* 1268.. 1370 

(ix) FEATURE: 

(A) NAME /KEY t CDS 

(B) LOCATION: 104.. 1267 
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(xi) SEQUENCE. DESCRIPTION: SEQ ID NO: 17: 

CGGGGGCGGG ACCAGGGTCC GGCCGGGGCG TGCCCCCGGG GAGGGACTCC CCGGCTTGCC 60 

CCCCGGCGTT GTCCGCGGTG CTCAGCGCCC GCCCGGGCGC GCC ATG GGG AAC CGC 115 

Met Gly Asn Arg 
1 

AGC ACC GCO GAC GCG GAC GGG CTG CTG GCT GGG CGC GGG CGG GCC GCG 163 
Ser Thr Ala Asp Ala Asp Gly Leu Leu Ala Gly Arg Gly Arg Ala Ala 
5 10 15 20 

GGG GCA TCT GCG GGG GCA TCT GCG GGG CTG GCT GGG GAG GGC GCG GCG 211 
Gly Ala Ser Ala Gly Ala Ser Ala Gly Leu Ala Gly Gin Gly Ala Ala 
25 30 35 

GCG CTG GTG GGG GGC GTG CTG CTC ATC GGC GCG GTG CTC GCG GGG AAC 259 
Ala Leu Val Gly Gly Val Leu Leu lie Gly Ala Val Leu Ala Gly Asn 
40 45 50 



TCG CTC GTG TGC GTG AGC GTG GCC ACC GAG CGC GCC CTG CAG ACG CCC 307 
Ser Leu Val Cys Val Ser Val Ala Thr Glu Arg Ala Leu Gin Thr Pro 
55 60 65 

ACC AAC TCC TTC ATC GTG AGC CTG GCG GCC GCC GAC CTC CTC CTC GCT 355 
Thr Asn Ser Phe lie Val Ser Leu Ala Ala Ala Asp Leu Leu Leu Ala 
70 75 80 

CTC CTG GTG CTG CCG CTC TTC GTC TAC TCC GAG GTC CAG GGT GGC GCG 403 
Leu Leu Val Leu Pro Leu Phe Val Tyr Ser Glu Val Gin Gly Gly Ala 
85 90 95 100 

TGG CTG CTG AGC CCC CGC CTG TGC GAC GCC CTC ATG GCC ATG GAC GTC 451 
Trp Leu Leu Ser Pro Arg Leu Cys Asp Ala Leu Met Ala Met Asp Val 
105 110 115 

ATG CTG TGC ACC GCC TCC ATC TTC AAC CTG TGC GCC ATC AGC GTG GAC 499 
Met Leu Cys Thr Ala Ser He Phe Asn Leu Cys Ala He Ser Val Asp 
120 125 .130 

AGG TTC GTG GCC GTG GCC GTG CCG CTG CGC TAC AAC CGG CAG GGT GGG 547 
Arg Phe Val Ala Val Ala Val Pro Leu Arg Tyr Asn Arg Gin Gly Gly 
135 140 145 

AGC CGC CGG CAG CTG CTG CTC ATC GGC GCC ACG TGG CTG CTG TCC GCG 595 
Ser Arg Arg Gin Leu Leu Leu He Gly Ala Thr Trp Leu Leu Ser Ala 
150 155 160 

GCG GTG GCG GCG CCC GTA CTG TGC GGC CTC AAC GAC GTG CGC GGC CGC 643 
Ala Val Ala Ala Pro Val Leu Cys Gly Leu Asn Asp Val Arg Gly Arg 
165 170 175 180 

GAC CCC GCC GTG TGC CGC CTG GAG GAC CGC GAC TAC GTG GTC TAC TCG 691 
Asp Pro Ala Val Cys Arg Leu Glu Asp Arg Asp Tyr Val Val Tyr Ser 
185 190 195 

TCC GTG TGC TCC TTC TTC CTA CCC TGC CCG CTC ATG CTG CTG CTG TAC 739 
Ser Val Cys Ser Phe Phe Leu Pro Cys Pro Leu Met Leu Leu Leu Tyr 
200 205 210 

TGG GCC ACG TTC CGC GGC CTG CAG CGC TGG GAG GTG GCA CGT CGC GCC 787 
Trp Ala Thr Phe Arg Gly Leu Gin Arg Trp Glu Val Ala Arg Arg Ala 
215 220 225 
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AAG CTG CAC GGC CGC GCG CCC CGC CGA CCC AGC GGC CCT GGC CCG CCT . 835 
Lys Leu Hie Gly Arg Ala Pro Arg Arg Pro Ser Gly Pro Gly Pro Pro 
230 235 240 

TCC CCC ACG CCA CCC GCG CCC CGC CTC CCC CAG GAC CCC TGC GGC CCC 883 
Ser Pro Xhr Pro Pro Ala Pro Arg Leu Pro Gin Asp Pro Cys Gly Pro 
245 250 255 260. 

GAC TGT GCG CCC CCC GCG CCC GGC CTC CCC CCG GAC CCC TGC GGC TCC 931 
Asp Cys Ala Pro Pro Ala Pro Gly Leu Pro Pro Asp Pro Cys Gly Ser 
265 270 275 

AAC TGT GCT CCC CCC GAC GCC GTC AGA GCC GCC GCG CTC CCA CCC CAG 979 
Asn Cys Ala Pro Pro Asp Ala Val Arg Ala Ala Ala Leu Pro Pro Gin 
280 285 290 

ACT CCA CCG CAG ACC CGC AGG AGG CGG CGT GCC AAG ATC ACC GGC CGG 1027 
Thr Pro Pro Gin Thr Arg Arg Arg Arg Arg Ala Lys lie Thr Gly Arg 
295 300 305 

GAG CGC AAG GCC ATG AGG GTC CTG CCG GTG GTG GTC GGG GCC TTC CTG 1075 
Glu Arg Lys Ala Met Arg Val Leu Pro Val Val Val Gly Ala Phe Leu 
310 315 320 

CTG TGC TGG ACQ CCC TTC TTC GTG GTG CAC ATC ACG CAG GCG CTG TGT 1123 
Leu Cys Trp Thr Pro Phe Phe Val Val His He Thr Gin Ala Leu Cys 
325 330 335 340 

CCT GCC TGC TCC GTG CCC CCG CGG CTG GTC AGC GCC GTC ACC TGG CTG 1171 
Pro Ala Cys Ser Val Pro Pro Arg Leu Val Ser Ala Val Thr Trp Leu 
345 350 355 

GGC TAC GTC AAC AGC GCC CTC ACC CCC GTC ATC TAG ACT GTC TTC AAC 1219 
Gly Tyr Val Asn Ser Ala Leu Thr Pro Val He Tyr Thr Val Phe Asn 
360 365 370 

GCC GAG TTC CGC AAC GTC TTC CGC AAG GCC CTG CGT GCC TGC TGC TGAGCCGGGC 1274 
Ala Glu Phe Arg Asn Val Phe Arg Lys Ala Leu Arg Ala Cys Cys 
375 380 385 

ACCCCCGGAC GCCCCCCGGC CTGATGGCCA GGCCTCAGGG ACCAAGGAGA TGGGGAGGGC 1334 
GCTTTTGTAC GTTAATTAAA CAAATTCCTT CCCAAA 1370 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 387 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Gly Asn Arg Ser Thr Ala Asp Ala Asp Gly Leu Leu Ala Gly Arg 
I 5 10 15 

Gly Arg Ala Ala Gly Ala Ser Ala Gly Ala Ser Ala Gly Leu Ala Gly 
20 25 30 

Gin Gly Ala Ala Ala Leu Val Gly Gly Val Leu Leu He Gly Ala Val 
35 40 45 
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Leu Ala Gly Asn Ser Leu Val Cys Val Ser Val Ala Thr Glu Arg Ala 
50 55 60 

Leu Gin Thr Pro Thr Asn Ser Phe lie Val Ser Leu Ala Ala Ala Asp 
65 70 75 80 

Leu Leu Leu Ala Leu Leu Val Leu Pro Leu Phe Val Tyr Ser Glu Val 
85 90 95 

Gin Gly Gly Ala Trp Leu Leu Ser Pro Arg Leu Cys Asp Ala Leu Met 
100 105 110 

Ala Ket Asp Val Met Leu Cys Thr Ala Ser lie Phe Asn Leu Cys Ala 
115 120 125 

He Ser Val Asp Arg Phe Val Ala Val Ala Val Pro Leu Arg Tyr Asn 
130 135 140 

Arg Gin Gly Gly Ser Arg Arg Gin Leu Leu Leu lie Gly Ala Thr Trp 
145 150 155 160 

Leu Leu Ser Ala Ala Val Ala Ala Pro Val Leu Cys Gly Leu Asn Asp 
165 170 175 

Val Arg Gly Arg Asp Pro Ala Val Cys Arg Leu Glu Asp Arg Asp Tyr 
180 185 190 

Val Val Tyr Ser Ser Val Cys Ser Phe Phe Leu Pro Cys Pro Leu Het 
195 200 205 

Leu Leu Leu Tyr Trp Ala Thr Phe Arg Gly Leu Gin Arg Trp Glu Val 
210 215 220 

Ala Arg Arg Ala Lye Leu His Gly Arg Ala Pro Arg Arg Pro Ser Gly 
225 230 235 240 

Pro Gly Pro Pro Ser Pro Thr Pro Pro Ala Pro Arg Leu Pro Gin Asp 
245 250 255 

Pro Cys Gly Pro Asp Cys Ala Pro Pro Ala Pro Gly Leu Pro Pro Asp 
260 265 270 

Pro Cys Gly Ser Asn Cys Ala Pro Pro Asp Ala Val Arg Ala Ala Ala 
275 280 285 

Leu Pro Pro Gin Thr Pro Pro Gin Thr Arg Arg Arg Arg Arg Ala Lys 
290 295 300 

He Thr Gly Arg Glu Arg Lys Ala Met Arg Val Leu Pro Val Val Val 
305 310 315 320 

Gly Ala Phe Leu Leu Cys Trp Thr Pro Phe Phe Val Val His He Thr 
325 330 335 

Gin Ala Leu Cys Pro Ala Cys Ser Val Pro Pro Arg Leu Val Ser Ala 
340 345 350 

Val Thr Trp Leu Gly Tyr Val Asn Ser Ala Leu Thr Pro Val He Tyr 
355 360 365 

Thr Val Phe Asn Ala Glu Phe Arg Asn Val Phe Arg Lys Ala Leu Arg 
370 375 380 

Ala Cys Cys 
385 
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(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS t 

(A) LENGTH: 1466 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(ix) FEATURE: 

(A) NAME/KEY: 5'UTR 

(B) LOCATION: 1**103 

(ix) FEATURE : 

(A) NAME /KEY: 3'UTR 

(B) LOCATION: 1364. .1466 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 104.. 1363 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

CGGGGGCGGG ACCAGGGTCC GGCCGGGGCG TGCCCCCGGG GAGGGACTCC CCGGCTTGCC 60 

CCCCGGCGTT GTCCGCGGTG CTCAGCGCCC GCCCGGGCGC GCC ATG GGG AAC CGC 115 

Met Gly Asn Arg 
1 

AGC ACC GCG GAC GCG GAC GGG CTG CTG GCT GGG CGC GGG CGG GCC GCG 163 
Ser Thr Ala Asp Ala Asp Gly Leu Leu Ala Gly Arg Gly Arg Ala Ala 
5 10 15 20 

GGG GCA TCT GCG GGG GCA TCT GGG GGG CTG GCT GGG CAG GGC GCG GCG 211 
Gly Ala Ser Ala Gly Ala Ser Ala Gly Leu Ala Gly Gin Gly Ala Ala 
25 30 35 

GCG CTG GTG GGG GGC GTG CTG CTC ATC GGC GCG GTG CTC GCG GGG AAC 259 
Ala Leu Val Gly Gly Val Leu Leu lie Gly Ala Val Leu Ala Gly Asn 
40 45 50 

TCG CTC GTG TGC GTG AGC GTG GCC ACC GAG CGC GCC CTG CAG ACG CCC 307 
Ser Leu Val Cys Val Ser Val Ala Thr Glu Arg Ala Leu Gin Thr Pro 
55 60 65 

ACC AAC TCC TTC ATC GTG AGC CTG GCG GCC GCC GAC CTC CTC CTC GCT 355 
Thr Asn Ser Phe lie Val Ser Leu Ala Ala Ala Asp Leu Leu Leu Ala 
70 75 80 

CTC CTG GTG CTG COG CTC TTC GTC TAC TCC GAG GTC CAG GGT GGC GCG 403 
Leu Leu Val Leu Pro Leu Phe Val Tyr Ser Glu Val Gin Gly Gly Ala 
85 90 95 100 

TGG CTG CTG AGC CCC CGC CTG TGC GAC GCC CTC ATG GCC ATG GAC GTC 451 
Trp Leu Leu Ser Pro Arg Leu Cys Asp Ala Leu Met Ala Met Asp Val 
105 110 115 

ATG CTG TGC ACC GCC TCC ATC TTC AAC CTG TGC GCC ATC AGC GTG GAC 499 
Met Leu Cys Thr Ala Ser lie Phe Asn Leu Cys Ala lie Ser Val Asp 
120 125 130 
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AGG TTC GTG GCC GTG GCC GTG CCG CTG CGC TAC AAC CGG CAG GG7 GGG 547 
Arg Phe Val Ala Val Ala Val Pro Leu Arg Tyr Asn Arg Gin Gly Gly 
135 140 145 

AGC CGC CGG CAG CTG CTG CTC ATC GGC GCC ACG TGG CTG CTG TCC GCG 595 
Ser Arg Arg Gin Leu Leu Leu lie Gly Ala Thr Trp Leu Leu Ser Ala 
150 155 160 

GCG GTG GCG GCG CCC GTA CTG TGC GGC CTC AAC GAC GTG CGC GGC CGC 643 
Ala Val Ala Ala Pro Val Leu Cys Gly Leu Aen Asp Val Arg Gly Arg 
165 170 175 180 

GAC CCC GCC GTG TGC CGC CTG GAG GAC CGC GAC TAC GTG GTC TAC TCG 691 
Asp Pro Ala Val Cys Arg Leu Glu Asp Arg Asp Tyr Val Val Tyr Ser 
185 190 195 

TCC GTG TGC TCC TTC TTC CTA CCC TGC CCG CTC ATG CTG CTG CTG TAC 739 
Ser Val Cys Ser Phe Phe Leu Pro Cys Pro Leu Met Leu Leu Leu Tyr 
200 205 210 

TGG GCC ACG TTC CGC GGC CTG CAG CGC TGG GAG GTG GCA CGT CGC GCC 787 
Trp Ala Thr Phe Arg Gly Leu Gin Arg Trp Glu Val Ala Arg Arg Ala 
215 220 225 

AAG CTG CAC GGC CGC GCG CCC CGC CGA CCC AGC GGC CCT GGC CCG CCT 835 
Lys Leu His Gly Arg Ala Pro Arg Arg Pro Ser Gly Pro Gly Pro Pro 
230 235 240 

TCC CCC ACG CCA CCC GCG CCC 'CGC CTC CCC CAG GAC CCC TGC GGC CCC 883 
Ser Pro Thr Pro Pro Ala Pro Arg Leu Pro Gin Asp Pro Cys Gly Pro 
245 250 255 260 

GAC TGT GCG CCC CCC GCG CCC GGC CTT CCC CGG GGT CCC TGC GGC CCC 931 
Asp Cys Ala Pro Pro Ala Pro Gly Leu Pro Arg Gly Pro Cys Gly Pro 
265 270 .275 

GAC TGT GCG CCC GCC GCG CCC AGC CTC CCC CAG GAC CCC TGC GGC CCC 979 
Asp Cys Ala Pro Ala Ala Pro Ser Leu Pro Gin Asp Pro Cys Gly Pro 
280 285 290 

GAC TGT GCG CCC CCC GCG CCC GGC CTC CCC CCG GAC CCC TGC GGC TCC 1027 
Asp Cys Ala Pro Pro Ala Pro Gly Leu Pro Pro Asp Pro Cys Gly Ser 
295 300 305 

AAC TGT GCT CCC CCC GAC GCC GTC AGA GCC GCC GCG CTC CCA CCC CAG 1075 
Asn Cys Ala Pro Pro Asp Ala Val Arg Ala Ala Ala Leu Pro Pro Gin 
310 315 320 

ACT CCA CCG CAG ACC CGC AGG AGG CGG CGT GCC AAG ATC ACC GGC CGG 1123 
Thr Pro Pro Gin Thr Arg Arg Arg Arg Arg Ala Lys lie Thr Gly Arg 
325 330 335. 340 

GAG CGC AAG GCC ATG AGG GTC CTG CCG GTG GTG GTC GGG GCC TTC CTG 1171 
Glu Arg Lys Ala Met Arg Val Leu Pro Val Val Val Gly Ala Phe Leu 
345 350 355 

CTG TGC TGG ACG CCC TTC TTC GTG GTG CAC ATC ACG CAG GCG CTG TGT 1219 
Leu Cys Trp Thr Pro Phe Phe Val Val His lie Thr Gin Ala Leu Cys 
360 365 370 

CCT GCC TGC TCC GTG CCC CCG CGG CTG GTC AGC GCC GTC ACC TGG CTG 1267 
Pro Ala Cys Ser Val Pro Pro Arg Leu Val Ser Ala Val Thr Trp Leu 
375 380 385 
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CGC TAC GTC AAC AGC CCC CTC ACC CCC GTC ATC TAC ACT GTC TTC AAC 1315 

Gly Tyr Val Asn Ser Ala Leu Thr Pro Val He Tyr Thr Val Phe Aan 

390 395 400 

GCC GAG TTC CGC AAC GTC TTC CGC AAG GCC CTG CGT GCC TGC TGC TGAGCCGGGC 1370 

Ala Glu Phe Arg Aen Val Phe Arg Lys Ala Leu Arg Ala Cys Cys 
405 410 415 420 

ACCCCCGGAC GCCCCCCGGC CTGATGGCCA GGCCTCAGGG ACCAAGGAGA TGGGGAGGGC 1430 

GCTTTTGTAC GTTAATTAAA CAAATTCCTT CCCAAA 1466 



(2) INFORMATION FOR SEQ ID NO: 20 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 419 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Gly Asn Arg Ser Thr Ala Asp Ala Asp Gly Leu Leu Ala Gly Arg 
1 5 10 15 

Gly Arg Ala Ala Gly Ala Ser Ala Gly Ala Ser Ala Gly Leu Ala Gly 
20 25 30 

Gin Gly Ala Ala Ala Leu Val Gly Gly Val Leu Leu He Gly Ala Val 
35 . 40 45 

Leu Ala Gly Asn Ser Leu Val Cys Val Ser Val Ala Thr Glu Arg Ala 
50 55 60 

Leu Gin Thr Pro Thr Asn Ser Phe He Val Ser Leu Ala Ala Ala Asp 
65 70 75 80 

Leu Leu Leu Ala Leu Leu Val Leu Pro Leu Phe Val Tyr Ser Glu Val 
85 90 95 

Gin Gly Gly Ala Trp Leu Leu Ser Pro Arg Leu Cys Asp Ala Leu Met 
100 105 110 

Ala Met Asp Val Met Leu Cys Thr Ala Ser He Phe Asn Leu Cys Ala 
115 120 125 

He Ser Val Asp Arg Phe Val Ala Val Ala Val Pro Leu Arg Tyr Asn 
130 135 140 

Arg Gin Gly Gly Ser Arg Arg Gin Leu Leu Leu He Gly Ala Thr Trp 
145 150 155 160 

Leu Leu Ser Ala Ala Val Ala Ala Pro Val Leu Cys Gly Leu Asn Asp 
165 170 175 

Val Arg Gly Arg Asp Pro Ala Val Cys Arg Leu Glu Asp Arg Asp' Tyr 
180 185 190 

Val Val Tyr Ser Ser Val Cys Ser Phe Phe Leu Pro Cys Pro Leu Met 
195 200 205 

Leu Leu Leu Tyr Trp Ala Thr Phe Arg Gly Leu Gin Arg Trp Glu Val 
210 215 .220 
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Ala Arg Arg Ala- Lys Leu His Gly Arg Ala Pro Arg Arg Pro Ser Gly 
225 230 235 240 

Pro Gly Pr Pro Ser Pro Thr Pro Pro Ala Pro Arg Leu Pro Gin Asp 
245 250 255 

Pro Cya Gly Pro Asp Cye Ala Pro Pro Ala Pro Gly Leu Pro Arg Gly 
260 265 270 

Pro Cys Gly Pro Asp Cys Ala Pro Ala Ala Pro Ser Leu Pro Gin Asp 
275 280 285 

Pro Cys Gly Pro Asp Cys Ala Pro Pro Ala Pro Gly Leu Pro Pro Asp 
290 295 300 

Pro Cye Gly Ser Asn Cys Ala Pro Pro Asp Ala Val Arg Ala Ala Ala 
305 310 315 320 

Leu Pro Pro Gin Thr Pro Pro Gin Thr Arg Arg Arg Arg Arg Ala Lys 
325 330 335 

lie Thr Gly Arg Glu Arg Lys Ala Met Arg Val Leu Pro Val Val Val 
340 345 350 

Gly Ala Phe Leu Leu Cys Trp Thr Pro Phe Phe Val Val His He Thr 
355 360 365 

Gin Ala Leu Cys Pro Ala Cys Ser Val Pro Pro Arg Leu Val Ser Ala 
370 375 380 

Val Thr Trp Leu Gly Tyr Val Asn Ser Ala Leu Thr Pro Val He Tyr 
385 390 395 400 

Thr Val Phe ABn Ala Glu Phe Arg Asn Val Phe Arg Lys Ala Leu Arg 
405 410 415 

Ala Cys Cys 



(2) INFORMATION FOR SEQ ID NOi21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1610 base pairs 
.■(B) TYPE: nucleic acid 

(C) STRANDE0NESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY: 5'UTR 

(B) LOCATION: 1..103 

(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 1508.. 1610 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 104.. 1507 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
CGGGGGCGGG ACCA6GGTCC GGCCGGGGCG TGCCCCCGGG GAGGGACTCC CCGGCTTGCC 60 

CCCCGGCGTT GTCCGCGGTG CTCAGCGCCC GCCCGGGCGC GCC ATG GGG AAC CGC 115 

Met Gly Asn Arg 
1 

AGC ACC GCG GAC GCG GAC GGG CTG CTG GCT GGG CGC GGG CGG GCC GCG 163 
Ser Thr Ala Asp Ala Asp Gly Leu Leu Ala Gly Arg Gly Arg Ala Ala 
5 10 15 20 

GGG GCA TCT GCG GGG GCA TCT GCG GGG CTG GCT GGG CAG GGC GCG GCG 211 
Gly Ala Ser Ala Gly Ala Ser Ala Gly Leu Ala Gly Gin Gly Ala Ala 
25 30 35 

GCG CTG GTG GGG GGC OTO CTG CTC ATC GGC GCG GTG CTC GCG GGG AAC 259 
Ala Leu Val Gly Gly Val Leu Leu He Gly Ala Val Leu Ala Gly Asn 
40 45 50 

TCG CTC GTG TGC GTG AGC GTG GCC ACC GAG CGC GCC CTG CAG ACG CCC 307 
Ser Leu Val Cys Val Ser Val Ala Thr Glu Arg Ala Leu Gin Thr Pro 
55 60 65 

ACC AAC TCC TTC ATC GTG AGC CTG GCG GCC GCC GAC CTC CTC CTC GCT 355 
Thr Asn Ser Phe He Val Ser Leu Ala Ala Ala Asp Leu Leu Leu Ala 
70 75 80 

CTC CTG GTG CTG CCG CTC TTC GTC TAC TCC GAG GTC CAG GGT GGC GCG 403 
Leu Leu Val Leu Pro Leu Phe Val Tyr Ser Glu Val Gin Gly Gly Ala 
85 90 95 100 

TGG CTG CTG AGC CCC CGC CTG TGC GAC GCC CTC ATG GCC ATG GAC GTC 451 
Trp Leu Leu Ser Pro Arg Leu Cys Asp Ala Leu Met Ala Met Asp Val 
105 110 115 

ATG CTG TGC ACC GCC TCC ATC TTC AAC CTG TGC GCC ATC AGC GTG GAC 499 
Met Leu Cys Thr Ala Ser He Phe Asn Leu Cys Ala He Ser Val Asp 
120 125 130 

AGG TTC GTG GCC GTG GCC GTG CCG CTG CGC TAC AAC CGG CAG GGT GGG 547 
Arg Phe Val Ala Val Ala Val Pro Leu Arg Tyr Asn Arg Gin Gly Gly 
135 140 145 

AGC CGC CGG CAG CTG CTG CTC ATC GGC GCC ACG TGG CTG CTG TCC GCG 595 
Ser Arg Arg Gin Leu Leu Leu He Gly Ala Thr Trp Leu Leu Ser Ala 
150 155 160 

GCG GTG GCG GCG CCC GTA CTG TGC GGC CTC AAC GAC GTG CGC GGC CGC 643 
Ala Val Ala Ala Pro Val Leu Cys Gly Leu Asn Asp Val Arg Gly Arg 
165 170 175 180 

GAC CCC GCC GTG TGC CGC CTG GAG GAC CGC GAC TAC GTG GTC TAC TCG 691 
Asp Pro Ala Val Cys Arg Leu Glu Asp Arg Asp Tyr Val Val Tyr Ser 
185 190 195 

TCC GTG TGC TCC TTC TTC CTA CCC TGC CCG CTC ATG CTG CTG CTG TAC 739 
Ser Val Cys Ser Phe Phe Leu Pro Cys Pro Leu Met Leu Leu Leu Tyr 
200 205 210 

TGG GCC ACG TTC CGC GGC CTG CAG CGC TGG GAG GTG GCA CGT CGC GCC 787 
Trp Ala Thr Phe Arg Gly Leu Gin Arg Trp Glu Val Ala Arg Arg Ala 
215 220 225 
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AAG 
Lye 


CTG 
Leu 
230 


GAC 
His 


GGC CGC GCG 
Gly Arg Ala 


CCC 
Pro 
235 


CGC 
Arg 


CGA 
Arg 


CCC 
Pro 


AGC GGC 
Ser Gly 
240 


CCT 
Pro 


GGC 
Gly 


CCG 
Pro 


CCT , 
Pro 


835 


TCC 
Ser 
245 


CCC 
Pro 


ACG 
Thr 


CCA 
Pro 


CCC 
Pro 


GCG 
Ala 


CCC 
Pro 


CGC 
Arg 


CTC 
Leu 


CCC 
Pro 


CAG GAC 
Gin Asp 
255 


CCC 
Pro 


TGC 
Cys 


GGC 
Gly 


CCC 
Pro 
260 


883 


GAC 
Asp 


TGT 
Cys 


GCG 
Ala 


CCC 

Pro 


CCC 
Pro 


GCG 
Ala 


CCC 
Pro 


GGC 
Gly 


CTT 
Leu 


CCC 
Pro 
270 


CGG GGT 
Arg Gly 


CCC 
Pro 


TGC 
Cys 


GGC 
Gly 
275 


CCC 
Pro 


931 


GAC 
Asp 


TGT 
Cys 


GCG 
Ala 


CCC 

Pro 


GCC 
Ala 


GCG 
Ala 


CCC 
Pro 


GGC 
Gly 


CTC 
Leu 
285 


CCC 
Pro 


CCG GAC 
Pro Asp 


CCC 
Pro 


TGC 
Cys 
290 


GGC 
Gly 


CCC 
Pro 


979 


GAC 
Asp 


TGT 
Cys 


GCG 
Ala 
295 


CCC 
Pro 


CCC 
Pro 


GCG 
Ala 


CCC 
Pro 


GGC 
Gly 
300 


CTC 
Leu 


CCC 
Pro 


CAG GAC 
Gin Asp 


CCC 
Pro 
305 


TGC 
Cys 


GGC 
Gly 


CCC 
Pro 


1027 


GAC 
Asp 


TGT 
Cys 
310 


GCG 
Ala 


CCC 
Pro 


CCC 
Pro 


GCG 
Ala 


CCC 

Pro 
315 


GGC 
Gly 


CTT 
Leu 


CCC 
Pro 


CGG GGT 
Arg Gly 
320 


CCC 
Pro 


TGC 
Cys 


GGC 
Gly 


CCC 

Pro 


1075 


GAC 
Asp 
325 


TGT 
Cys 


GCG 
Ala 


CCC 
Pro 


ccc 
Pro 


GCG 
Ala 
330 


ccc 
Pro 


GGC 
Gly 


CTC 
Leu 


CCC 
Pro 


CAG GAC 
Gin Asp 
335 


CCC 
Pro 


TGC 
Cys 


GGC 
Gly 


ccc 
Pro 
340 


1123 


GAC 
Asp 


TGT 
Cys 


GCG 
Ala 


CCC 
Pro 


ccc 
Pro 
345 


GCG 
Ala 


ccc 
Pro 


GGC 
Gly 


CTC 
Leu 


CCC 
Pro 
350 


CCG GAC 
Pro Asp 


CCC 

Pro 


TGC 
Cys 


GGC 
Gly 
355 


TCC 
Ser 


1171 


AAC 
Asn 


TGT 
Cys 


GCT 
Ala 


CCC CCC GAC 
Pro Pro Asp 
360 


GCC 
Ala 


GTC 
Val 


AGA 
Arg 
365 


GCC 
Ala 


GCC 
Ala 


GCG 
Ala 


CTC 
Leu 


CCA 
Pro 
370 


CCC 
Pro 


CAG 
Gin 


1219 


ACT 
Thr 


CCA 
Pro 


CCG 
Pro 
375 


CAG ACC CGC 
Gin Thr Arg 


AGG 
Arg 


AGG 
Arg 
380 


CGG 
Arg 


CGT 
Arg 


GCC AAG 
Ala Lys 


ATC 
He 
385 


ACC 
Thr 


GGC 
Gly 


CGG 
Arg 


1267 


GAG 
Glu 


CGC 
Arg 
390 


AAG 
Lys 


GCC ATG AGG 
Ala Met Arg 


GTC 
Val 
395 


CTG 
Leu 


CCG 
Pro 


GTG 
Val 


GTG 
Val 


GTC 
Val 
400 


GGG 
Gly 


GCC 
Ala 


TTC 
Phe 


CTG 
Leu 


1315 


CTG 
Leu 
405 


TGC 
Cys 


TGG 
Trp 


ACG 
Thr 


CCC 
Pro 


TTC 
Phe 
410 


TTC 
Phe 


GTG 
Val 


GTG 
Val 


CAC 
His 


ATC 
He 
415 


ACG 
Thr 


CAG 
Gin 


GCG 
Ala 


CTG 
Leu 


TGT 
Cys 
420 


1363 


CCT 
Pro 


GCC 
Ala 


TGC 
Cys 


TCC 
Ser 


GTG 
Val 
425 


CCC 
Pro 


CCG 
pro 


CGG 
Arg 


CTG 
Leu 


GTC 
vai 
430 


AGC 
Ser 


GCC 
Ala 


GTC 
vax 


ACC 
rnr 


TGG 
Trp 
435 


CTG 
Leu 


1411 


GGC 
Gly 


TAC 
Tyr 


GTC 
Val 


AAC 
Asn 
440 


AGC 
Ser 


GCC 
Ala 


CTC 
Leu 


ACC 
Thr 


CCC 
Pro 
445 


GTC 
Val 


ATC 
He 


TAC 
Tyr 


ACT 
Thr 


GTC 
Val 
450 


TTC 
Phe 


AAC 
Asn 


1459 


GCC 
Ala 


GAG 
Glu 


TTC 
Phe 
455 


CGC AAC 
Arg Asn 


GTC 
Val 


TTC 
Phe 


CGC 
Arg 
460 


AAG 
Lys 


GCC 
Ala 


CTG 
Leu 


CGT 
Arg 


GCC 
Ala 
465 


TGC 
Cys 


TGC 
Cys 


TGAGCCGGGC 1514 



ACCCCCGGAC GCCCCCCGGC CTGATGGCCA GGCCTCAGGG ACCAAGGAGA TGGGGAGGGC 1574 
GCTTTTGTAC GTTAATTAAA CAAATTCCTT CCCAAA 1610 
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(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS i 

(A) LENGTH: 467 amino acids 

(B) TYPEs amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

Met Gly Asn Arg Ser Thr Ala Asp Ala Asp Gly Leu Leu Ala Gly Arg 
1 5 10 15 

Gly Arg Ala Ala Gly Ala Ser Ala Gly Ala Ser Ala Gly Leu Ala Gly 
20 25 30 

Gin Gly Ala Ala Ala Leu Val Gly Gly Val Leu Leu He Gly Ala Val 
35 40 45 

Leu Ala Gly Asn Ser Leu Val Cys Val Ser Val Ala Thr Glu Arg Ala 
50 55 60 

Leu Gin Thr Pro Thr Asn Ser Phe He Val Ser Leu Ala Ala Ala Asp 
65 70 75 80 

Leu Leu Leu Ala Leu Leu Val Leu Pro Leu Phe Val Tyr Ser Glu Val 
85 90 95 

Gin Gly Gly Ala Trp Leu Leu Ser Pro Arg Leu Cys Asp Ala Leu Met 
100 105 HO 

Ala Met Asp Val Met Leu Cys Thr Ala Ser He Phe Asn Leu Cys Ala 
115 120 125 

He Ser Val Asp Arg Phe Val Ala Val Ala Val Pro Leu Arg Tyr Asn 
130 135 140 

Arg Gin Gly Gly Ser Arg Arg Gin Leu Leu Leu He Gly Ala Thr Trp 
145 150 155 160 

Leu Leu Ser Ala Ala Val Ala Ala Pro Val Leu Cys Gly Leu Asn Asp 
165 170 175 

Val Arg Gly Arg Asp Pro Ala Val Cys Arg Leu Glu Asp Arg Asp Tyr 
180 185 190 

Val Val Tyr Ser Ser Val Cys Ser Phe Phe Leu Pro Cys Pro Leu Met 
195 200 205 

Leu Leu Leu Tyr Trp Ala Thr Phe Arg Gly Leu Gin Arg Trp Glu Val 
210 215 220 

Ala Arg Arg Ala Lys Leu His Gly Arg Ala Pro Arg Arg Pro Ser Gly 
225 230 235 240 

Pro Gly Pro Pro Ser Pro Thr Pro Pro Ala Pro Arg Leu Pro Gin Asp 
245 250 255 

Pro Cys Gly Pro Asp Cys Ala Pro Pro Ala Pre Gly Leu Pro Arg Gly 
260 265 270 

Pro Cys Gly Pro Asp Cys Ala Pro Ala Ala Pro Gly Leu Pro Pro Asp 
275 280 285 
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Pro Cyfl Gly Pro Asp Cys Ala Pro Pro Ala Pro Gly Leu Pro Gin Asp 
290 295 300 

Pro Cys Gly Pro Asp Cys Ala Pro Pro Ala Pro Gly Leu Pro Arg Gly 
305 310 315 320 

Pro Cys Gly Pro Asp Cys Ala Pro Pro Ala Pro Gly Leu Pro Gin Asp 
325 330 335 

Pro Cys Gly Pro Asp Cys Ala Pro Pro Ala Pro Gly Leu Pro Pro Asp 
340 345 350 

Pro Cys Gly Ser Asn Cys Ala Pro Pro Asp Ala Val Arg Ala Ala Ala 
355 360 365 

Leu Pro Pro Gin Thr Pro Pro Gin Thr Arg Arg Arg Arg Arg Ala Lys 
370 375 380 

lie Thr Gly Arg Glu Arg Lys Ala Met Arg Val Leu Pro Val Val Val 
385 390 395 400 

Gly Ala Phe Leu Leu Cys Trp Thr Pro Phe Phe Val Val His He Thr 
405 410 415 

Gin Ala Leu Cys Pro Ala Cys Ser Val Pro Pro Arg Leu Val Ser Ala 
420 425 430 

Val Thr Trip Leu Gly Tyr Val Asn Ser Ala Leu Thr Pro Val lie Tyr 
435 440 . 445 

Thr Val Phe Asn Ala Glu Phe Arg Asn Val Phe Arg Lys Ala Leu Arg 
450 455 460 

Ala Cys Cys 
465 
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WHAT WE CLAIM IS: 

1. A DNA sequence comprising a nucleotide sequence encoding a 
mammalian dopamine receptor, wherein the mammalian dopamine receptor has the 
drug dissociation properties of the human dopamine receptor D4. 
5 2, The DNA sequence of Claim 1 wherein the mammalian dopamine 

receptor encoded is the human D4 dopamine receptor. 

3. The DNA sequence of Claim 1 wherein the mammalian dopamine 
receptor encoded therein has the drug dissociation properties described in Table 1. 

4. The DNA sequence of Claim 1 wherein the mammalian dopamine 
10 receptor encoded therein has a high affinity for the drug clozapine. 

5. The DNA sequence of Claim 1 comprising a repeated DNA 
sequence that is substantially homologous to the sequence: 

5'-A CCC GCG CCC CGC CTC CCC CAG GAC CCC TGC GGC CCC GAC 
TGTGCGCC-3' 

15 6. The DNA sequence of Claim 5 comprising from about 2 to about 8 

copies of the repeated DNA sequence. 

7. The DNA sequence of Claim 5 having a sequence substantially 
homologous to allele D4.2 of the human D4 dopamine receptor gene [SEQ ID No.: 
17]. 

20 8. The DNA sequence of Claim 5 having a sequence substantially 

homologous to allele D4.4 of the human D4 dopamine receptor gene [SEQ ID No. : 
19]. 

9. The DNA sequence of Claim 5 having a sequence substantially 
homologous to allele D4.7 of the human D4 dopamine receptor gene [SEQ ID No. : 

25 21]. 

10. A homogeneous composition of a 41 kilodalton human dopamine 
receptor D4 or derivative thereof, wherein the amino acid sequence of the 
dopamine receptor or derivative thereof is substantially homologous to the 
sequence in Figure 3. 

30 11. The homogeneous composition of Claim 10 wherein the amino acid 

sequence of the dopamine receptor or derivative thereof comprises a repeated 
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amino acid sequence that is substantially homologous to the sequence: 
(P/A) AP(R/G)LP(Q/R/P)(D/G)PCG(P/S)(D/N)CAP 
12. The amino acid sequence of Claim 11 comprising from about 2 to 
about 8 copies of the repeated amino acid sequence. 
5 13. The amino acid sequence of Claim 11 having a sequence 

substantially homolgous to the amino acid sequence encoded by allele D4.2 of the 
human D4 dopamine receptor gene [SEQ ID No. : 18]. 

14. The amino acid sequence of Claim 11 having a sequence 
substantially homolgous to the amino acid sequence encoded by allele D4.4 of the 

10 human D4 dopamine receptor gene [SEQ ID No. : 20] . 

15. The amino acid sequence of Claim 11 having a sequence 
substantially homolgous to the amino acid sequence encoded by allele D4.7 of the 
human D4 dopamine receptor gene [SEQ ID No.: 22]. 

16. A recombinant DNA construct comprising a nucleotide sequence 
IS encoding the human dopamine receptor D4. 

17. A recombinant expression construct comprising the DNA sequence 
of Claim 2, wherein the construct is capable of expressing the human dopamine 
receptor D4 in a transformed eukaryotic cell culture. 

18. The recombinant expression vector of Claim 17 wherein the DNA 
20 sequence comprises a repeated DNA sequence that is substantially homologous to 

the sequence: 

S'-A CCC GCG CCC CGC CTC CCC CAG GAC CCC TGC GGC CCC GAC 
TGTGCGCC-3' 

19. The DNA sequence of Claim 18 comprising from about 2 to about 
25 8 copies of the repeated DNA sequence. 

20. The DNA sequence of Claim 18 having a sequence substantially 
homologous to allele D4.2 of the human D4 dopamine receptor gene [SEQ ID No. : 
17]. 

21. The DNA sequence of Claim 18 having a sequence substantially 
30 homologous to allele D4.4 of the human D4 dopamine receptor gene [SEQ ID No. : 

19]. 
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. 22. The DNA sequence of Claim 18 having a sequence substantially 
homologous to allele D4.7 of the human D4 dopamine receptor gene [SEQ ID No. : 
21]. 

23. A eukaryotic cell culture transformed with the recombinant 
5 expression construct of Claim 17, wherein the transformed eukaryotic cell culture 

is capable of expressing the human dopamine receptor D4. 

24. A method of screening a compound as an inhibitor of dopamine 
binding to the human dopamine receptor D4, the method comprising the following 
steps: 

10 (a) transforming a eukaryotic cell culture with an expression vector as 

in Claim 1 7 capable of expressing the human dopamine receptor D4 
in a eukaryotic cell; and 

(b) assaying for ability of the compound to inhibit the binding of a 
detectable dopamine analog. 

IS 25. A method of screening a compound for anti-psychotic activity, the 

method comprising the following steps: 

(a) transforming a eukaryotic cell culture with an expression vector as 
in Claim 17 capable of expressing the human dopamine receptor D4 
in a eukaryotic cell; 

20 (b) assaying for ability of the compound to inhibit the binding of a 

detectable dopamine analog; and 

(c) testing those drugs for anti-psychotic activity based on their affinity 
for the D4 dopamine receptor. 

26. A method of quantitatively detecting a compound as an inhibitor of 
25 dopamine binding to the human dopamine receptor D4, the method comprising the 
following steps: 

(a) transforming a eukaryotic cell culture with an expression vector as 
in Claim 17 capable of expressing the human dopamine receptor D4 
in a eukaryotic cell; and 
30 (b) assaying for amount of a compound by measuring the extent of 

inhibition of binding of a detectable dopamine analog. 
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27. The method of Claim 26 wherein the compound to be tested is 
present in a human. 

28. The method of Claim 26 wherein the compound is present in human 

blood. 

5 29. The method of Claim 26 wherein the compound is present in human . 

cerebrospinal fluid. 

30. The method of Claim 26 wherein the compound is present in human 

brain, 

31. The method of Claim 26 wherein the compound is unknown. 

10 32. A method for detecting a restriction fragment length polymorphism 

in a gene encoding a D4 dopamine receptor in a human comprising the following 
steps: 

(a) isolating a sufficient quantity of DNA from the human; 

(b) digesting the DNA with a first restriction enzyme that is Pstl 
15 and a second restriction enzyme that is Hincll to produce a 

multiplicity of fragments of digested DNA; 

(c) analyzing the fragments of digested DNA by hybridization 
with a probe comprising the nucleic acid sequence of Claim 
2; and 

20 (d) detecting a pattern of the hybridized fragments of the human 

dopamine receptor gene. 
33. A method for screening a population of humans to determine the 
frequency of restriction fragment length polymorphism of a gene encoding a D4 
dopamine receptor comprising the following steps: 
25 (a) detecting a pattern of DNA fragments of the gene encoding 

a dopamine receptor in each individual human DNA sample 
according to the method of Claim 32; 
(b) comparing the patterns detected in the DNA of the 
population of humans with the patterns of a representative 
30 panel of restriction fragment length polymorphisms in a 

human D4 dopamine receptor gene present in humans; and 
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(c) computing the frequency of each particular restriction 
fragment length polymorphism of a dopamine receptor gene 
in humans. 

34. A method for determining the presence of a restriction fragment 
5 length polymorphism in a gene encoding a dopamine receptor in an individual 

human comprising the following steps: 

(a) detecting a pattern of DNA fragments of a dopamine 
receptor gene in the individual human according to the 
method of Claim 32; and 
10 (b) comparing the pattern detected in the DNA of an individual 

human with the patterns of a representative panel of 
restriction fragment length polymorphisms in a human 
dopamine receptor gene. 

35. A method for identifying a human target population for 
15 administration of a therapeutic drug for the prevention or alleviation of disease 

states in a human related to a human D4 dopamine receptor comprising the 
following steps: 

(a) detecting a pattern of DNA fragments of a human dopamine 
receptor gene in the individual human according to the 

20 method of Claim 32; 

(b) comparing the pattern detected in the DNA of each 
individual human with the patterns of a representative panel 
of restriction fragment length polymorphisms in a human 
dopamine receptor gene; 

25 (c) identifying the individual humans who are members of the 

target population expressing the appropriate pattern of 
restriction fragment length polymorphisms in a human 
dopamine receptor gene; and 

(d) treating the members of the human target population 
30 expressing the appropriate pattern of restriction fragment 

length polymorphisms in the a human dopamine receptor 
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gene with the therapeutic drug for the prevention or 
alleviation of disease states in a human related to a human 
D4 dopamine receptor. 
36/ A reagent for detecting a restriction fragment length polymorphism 
5 in a human D4 dopamine receptor gene comprising the nucleic acid sequence of 
Claim 2. 

37. A method for detecting alleles of a gene encoding a D4 dopamine 
receptor in a human comprising the following steps: 

(a) isolating a sufficient quantity of DNA from the human; 
10 (b) amplifying in vitro DNA comprising a polymorphic region 

of the D4 dopamine receptor gene; 

(c) detecting a pattern of amplified DNA fragments of the D4 
dopamine receptor gene; and 

(d) identifying the alleles of the D4 dopamine receptor gene 
IS corresponding to the amplified DNA fragments detected. 

38. A method for screening a population of humans to determine the 
frequency of alleles of a gene encoding a D4 dopamine receptor comprising the 
following steps: 

(a) detecting a pattern of amplified DNA fragments of the D4 
20 dopamine receptor gene in each individual human DNA 

sample according to the method of Claim 37; 

(b) identifying the alleles of the D4 dopamine receptor gene 
corresponding to the patterns of amplified DNA fragments 
detected in the DNA of the population of humans; and 

25 (c) computing the frequency of each allele of the D4 dopamine 

receptor gene in the human population screened. 

39. A method for determining a genotype of D4 dopamine receptor 
alleles in an individual human comprising the following steps: 

(a) detecting a pattern of amplified DNA fragments of the D4 
30 dopamine receptor gene in the individual human according 

to the method of Claim 37; and 
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(b) identifying the alleles of the D4 dopamine receptor gene 
corresponding to the patterns of amplified DNA fragments 
detected in the DN A of the individual human. 
40. A method for identifying a human target population for 
5 administration of a therapeutic drug for the prevention or alleviation of disease 
states in a human related to a human D4 dopamine receptor comprising the 
following steps: 

(a) detecting a pattern of amplified DNA fragments of a human 
dopamine receptor gene in the individual human according 

10 to the method of Claim 39; 

(b) identifying the alleles comprising a genotype of the D4 
dopamine receptor gene corresponding to the patterns of 
amplified DNA fragments detected in the DNA of the 
individual human; 

15 (c) identifying the individual humans who are members of the 

target population having the appropriate genotype of the D4 

dopamine receptor gene; and 
(d) treating the members of the human target population having 

the appropriate genotype of the D4 dopamine receptor gene 
20 with the therapeutic drug for the prevention or alleviation of 

disease states in a human related to a human D4 dopamine 

receptor. 
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S # -w.,w« gg*cc«gggteeg9ceggggcgtgcceeerogg»9gg*ctcceegg^t0cceeeeggcrttrtccgc77ci 

cteagcgeecgcccgggcgegee AM 060 AAC osc MO ACC oce oac 6C0 GAC ooo cto cto ecr coo coc 

Mat Oly Aaa Arg Sar Tax Ala Aip Ala Acp Gly L«u Ua Ala Cly JLr*, 
A m 

600 CCO GCC GCO 000 OCX TCTCCOGGOCCATCTGCOGGCeTOCCTGflOCAOGCCgCGCCOGCGCTC 
01/ tro JUm Ala 01/ Ala fa* JUa 61f Ala far Ala Gly u* Ala OXy 61a 61/ Ui Ala Ala Ut- 
ile 

CTO CCO GCC OTO C90 CTC ATC OCC GCO CTO CTC OCO GCO AAC TCO CTC GTO TCC CTO AGC GTO GCC 
TaX Oly Oly Tal Ui U* Xla Oly AXa TaX U« Ala Oly Aaa far Uo Tal Cya TaX far Tal All 

24( 

ACC GAG CGC~tCC CTOCAOACOCCCACCAACfCCTTCATCCTOAOCCTOCCOGCCCCCGACCTCCTC 
Th* Ola Axg Ua Ua Ola Sat Fro Tax Aaa far Paa Zla TaX Sax Lav Ala Ala Ala Arp Uu La« 

CTCGCTCTCCTOGTGCTOCCGCTCTTCCTCTACTCCGAO gtgagccgegtecggecgea 

Lao Ala Uu Uo TaX Uo tea bant taa TaX Tye Sax OXa 

. . .ceegtggtgtegcegcgcag OTC CAO GOT OGC 6CO TOO CTO CTO AGC CCC CGC CTO TCC GAC GCC CTC 

TaX OXa OXy OXy Ala Trp Uo baa lax t«o Axg Am Cya JUp Ala Lata 

391 

ATO GCC ATO GAC OTC ATO CTO TOO ACC GCC TCC ATC TTC AAC CTO TGC GCC ATC AGC GTG GAC AG 
Mat Ala Kat A*p TaX Mat Laa Cya Ttox AXa Sac ZXa taa Aaa Am Cya AXa Ua Sac Tal Aap Axg 

gtgegccgcceteecc^cccgcgccccygcgcceccgcgceecgccegcegcceeeaecgeggceegtgcgctgtceggcgcecect 

43C 

cggcgctCCCCgcag GTTCGTOGCCGTOGCCCTOCCOCTOCOCtACAAeCOOCAO COT CCO ACC CSC 

tha TaX AXa TaX AXa TaX fro In Axg Tyx Aaa Axg OXa Oly OXy Sac Axg 

O 3X1 

COO CAO CTO CTO CTC ATC GOCOCCACOTOOCTOCTOTCCGCO OCO CTO CCO CCO CCC GTA CTO TCC 
Axg OXa Uo Uu Ua XXa OXy AXa Tax Tip Ua Ua Sax AXa AXa TaX AXa AXa Tea TaX Ua cyi 

£k2c^aac£^^ 

OXy Uo Aen Axp TaX Axg OXy Axg Axp Ho Ala TaX Cya Axg lao Ola Acp Acg Acp Tye TaX Tal 

§3IC£T^ 

Tyx Sac Sac TaX Cya Sax tha tha Ua fro Cya tea Ua Nat taa Ua Ua Tyr Txp AXa Tax ta< 

JCCC^CCC r .CTC k atft:COC^YGO*CM^ 

Arg OXy Ua OXa Axg Trp Ola TaX AXa Axg Axg AXa Aya Ua 11a OXy Axg AXa tea Axg Arg tc< 

5»ic^iccT^coc3.^ 

Sac OXy Tea OXy tea tea Sax tea Tax rro^Tr© Ala tea Axg Uo tea Ola Axp tea Cya Oly tec 

^CJk^i^^at^w^GCaidX^^ teeceggggeecotgeggee. eetgtgcgccccecgegeccggee 

Aap Cya Ala tea tea AXa tea OXy too 



ceeecaggacccctgcggccccgactgtgegcccccegegcccggcet 




ggfrcccjicccfqcsocg ra 

Ala f ei^te^AipjS^aXl^ me Axg Ax g Ac 

aco^cOTyogc^juygjj^ gtg 
Axg Axg AXa Ays XXa t^e OXy Axg Ola Axg Ays AXa Mat Axg TaX Aaa tea TaX TaX TaX 

gttcctgtcctgaggggcggggaggagaggagggggggigtacgaggccggctgggcggggggcgctaacgcggctctcggcgccc 
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Specific [ 3 H] Spiperone Bound, pM 
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